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PNEUMOCOCCAL GENES, PORTIONS THEREOF, 
EXPRESSION PRODUCTS THEREFROM, 
AND USES OF SUCH GENES, PORTIONS AND PRODUCTS 

RELATED APPLICATIONS 

This application is a continuation-in-part ( ,f CIP M ) : of 
application Serial Nos. 08,529,055, filed September 15, 1995, 
08/226,844, filed May 29, 1992, 08/093,907, filed May 29, 1992, 
07/884,918, filed July 5, 1994 (corresponding to PCT/US93/05191) ; 
of application Serial No. 08/482,981, filed June 7, 1995; of 
application Serial No. 08/458,399, filed June 2, 1995; of 
application Serial No. 08/446,201, filed May 19, 1995 (as a CIP 
of USSN 08/246,636); of application Serial No. 08/246,636, filed 
May 20, 1994 (as a CIP of USSN 08/048,896, filed April 20, 1993 
as a CIP of USSN 07/835,698, filed February 12, 1992 as a CIP of 
USSN 07/656,773); of application Serial 08/319,795, filed October 
7, 1994 (as a CIP of USSN 08/246,636); of application Serial No. 
08/072,070, filed June 3, 1993; of application Serial No. 
07/656,773, filed February 15, 1991 (USSN 656,773 and 835,698 
corresponding to Int'l application WO 92/1448); and, each of 
these applications, as well as each application, document or 
reference cited in these applications, is hereby incorporated 
herein by reference. Documents or references are also cited in 
the following text, either in a Reference List appended to 
certain Examples, or before the claims, or in the text itself; 
and, each of these documents or references is hereby expressly 
incorporated herein by reference. 
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FIELD OF THE INVENTION 

This invention relates to pneumococcal genes, portions 
thereof, expression products therefrom and uses of such genes, 
portions and products; especially to genes of Streptococcus 
pneumoniae, e.g,, the gene encoding pneumococcal surface protein 
A (PspA) (said gene being "pspA") , pspA-like genes, pneumococcal 
surface protein C (PspC) (said gene being "pspC") , portions of 
such genes, expression products therefrom, and the uses of such 
genes, portions thereof and expression products therefrom. Such 
uses include uses of the genes and portions thereof for obtaining 
expression products by recombinant techniques, as well as for 
detecting the presence of Streptococcus pneumoniae or strains 
thereof by detecting DNA thereof by hybridization or 
amplification (e.g*, PCR) and hybridization techniques (e.g., 
obtaining DNA-containing sample, contacting same with genes or 
fragment under PCR, amplification and/or hybridization 
conditions, and detecting presence of or isolating hybrid or 
amplified product) . The expression product uses include use in 
preparing antigenic, immunological or vaccine compositions, for 
eliciting antibodies, an immunological response (other than or 
additional to antibodies) or a protective response (including 
antibody or other immunological response by administering 
composition to a suitable host); or, the expression product can 
be for use in detecting the presence of streptococcus pneumoniae 
by detecting antibodies to Streptococcus pneumoniae protein (s) or 
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antibodies to a portion thereof in a host, e.g., by obtaining an 
antibody-containing sample from a relevant host, contacting the 
sample with expression product and detecting binding (for 
instance by having the product labeled) ; and, the antibodies 
generated by the aforementioned compositions are useful in 
diagnostic or detection kits or assays. Thus, the invention 
relates to varied compositions of matter and methods for use 
thereof . 

BACKGROUND OF THE INVENTION 

Streptococcus pneumoniae is an important cause of 
otitis media, meningitis, bacteremia and pneumonia. Despite the 
use of antibiotics and vaccines, the prevalence of pneumococcal 
infections has declined little over the last twenty-five years. 

It is generally accepted that immunity to Streptococcus 
pneumoniae can be mediated by specific antibodies against the 
polysaccharide capsule of the pneumococcus . However, neonates 
and young children fail to make an immune response against 
polysaccharide antigens and can have repeated infections 
involving the same capsular serotype. 

One approach to immunizing infants against a number of 
encapsulated bacteria is to conjugate the capsular polysaccharide 
antigens to protein to make them immunogenic. This approach has 
been successful, for example, with Haemophilus influenzae b (see 
U.S. Patent no. 4,496,538 to Gordon and U.S. Patent no. 4,673,574 
to Anderson) . However, there are over eighty known capsular 
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serotypes of S. pneumoniae of which twenty-three account for roost 
of the disease. For a pneumococcal polysaccharide-protein 
conjugate to be successful, the capsular types responsible for 
most pneumococcal infections would have to be made adequately 
immunogenic. This approach may be difficult, because the twenty- 
three polysaccharides included in the presently-available vaccine 
are not all adequately immunogenic, even in adults. 

An alternative approach for protecting children, and 
also the elderly, from pneumococcal infection would be to 
identify protein antigens that could elicit protective immune 
responses. Such proteins may serve as a vaccine by themselves, 
may be used in conjunction with successful polysaccharide-protein 
conjugates, or as carriers for polysaccharides, 

McDaniel et al. (I), J. Exp* Med. 160:386-397, 1984, 
relates to the production of hybridoma antibodies that recognize 
cell surface polypeptide (s) on S. pneumoniae and protection of 
mice from infection with certain strains of encapsulated 
pneumococci by such antibodies. This surface protein antigen has 
been termed "pneumococcal surface protein A" or PspA for short. 

McDaniel et al. (II), Microbial Pathogenesis 1:519-531, 
1986, relates to studies on the characterization of the PspA. 
Considerable diversity in the PspA molecule in different strains 
was found, as were differences in the epitopes recognized by 
different antibodies. 
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McDaniel et al. (Ill), J* Exp. Med* 165:381-394, 1987, 
relates to immunization of X-linked immunodef icient (XID) mice 
with non-encapsulated pneumococci expressing PspA, but not 
isogenic pneumococci lacking PspA, protects mice from subsequent 
fatal infection with pneumococci. 

McDaniel et al. (IV), Infect. Immun., 59:222-228, 1991, 
relates to immunization of mice with a recombinant full length 
fragment of PspA. that is able to elicit protection against 
pneumococcal strains of capsular types 6A and 3. 

Crain et al, Inf ect. Immun. , 56:3293-3299, 1990, relates 
to a rabbit antiserum that detects PspA in 100% (n = 95) of 
clinical and laboratory isolates of strains of S. pneumoniae. 
When reacted with seven monoclonal antibodies to PspA, fifty- 
seven S. pneumoniae isolates exhibited thirty-one different 
patterns of reactivity. 

The PspA protein type is independent of capsular type. 
It would seem that genetic mutation or exchange in the 
environment has allowed for the development of a large pool of 
strains which are highly diverse with respect to capsule, PspA, 
and possibly other molecules with variable structures. 
Variability of PspA's from different strains also is evident in 
their molecular weights, which range from 67 to 99 kD. The 
observed differences are stably inherited and are not the result 
of protein degradation. 
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Immunization with a partially purified PspA from a 
recombinant A gtli clone, elicited protection against challenge 
with several S. pneumoniae strains representing different 
capsular and PspA types, as described in McDaniel et al. (IV), 
Infect, immun. 59:222-228, 1991. Although clones expressing PspA 
were constructed according to that paper, the product was 
insoluble and isolation from cell fragments following lysis was 
not possible. 

While the protein is variable in structure between 
different pneumococcal strains, numerous cross-reactions exist 
between all PspA's, suggesting that sufficient common epitopes 
may be present to allow a single PspA or at least a small number 
of PspA's to elicit protection against a large number of S. 
pneumoniae strains. 

In addition to the published literature specifically 
referred to above, the inventors, in conjunction with co-workers, 
have published further details concerning PspA's, as follows: 

1. Abstracts of 89th Annual Meeting of the American 
Society for Microbiology, p. 125, item D-257, May 
1989; 1 

2. Abstracts of 90th Annual Meeting of the American 
Society for Microbiology, p. 98, item D-106, May 
1990; 1 

3. Abstracts of 3rd International ASM Conference on 
Streptococcal Genetics, p. n, item 12, June 1990; 

4. Talkington et al, Infect. Immun. 59:1285-1289. 
1991; 
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5* Yother et al (I), J. Bacteriol. 174:601-609, 1992; 
and 

6. Yother et al (II), J. Bacteriol. 174:610-618, 
1992. 

7. McDaniel et al (V) , Microbiol. Pathogenesis, 
13:261-268* 

It would be useful to provide PspA or fragments thereof 
in compositions, including PspA's or fragments from varying 
strains in such compositions, to provide antigenic, immunological 
or vaccine compositions; and, it is even further useful to show 
that the various strains can be grouped or typed, thereby 
providing a basis for cross-reactivities of PspA's or fragments 
thereof, and thus providing a means for determining which strains 
to represent in such compositions (as well as how to test for, 
detect or diagnose one strain from another) . 

Further, it would be advantageous to provide a pspA - 
like gene or a pspC gene in certain strains, as well as primers 
(oligonucleotides) for identification of such a gene, as well as 
of conserved regions in that gene and in pspA; for instance, for 
detecting, determining, isolating, or diagnosing strains of S. 
pneumonia. These uses and advantages, it is believed, have not 
heretofore been provided in the art. 
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OBJECTS AND SUMMARY OF THE INVENTION 

The invention provides an isolated amino acid molecule 
comprising residues 1 to 115, 1 to 260, 192 to 588, 192 to 299, 
or residues 192 to 2 60 of pneumococcal surface protein A of 
Streptococcus pneumoniae* 

The invention further provides an isolated DNA molecule 
comprising a fragment of a pneumococcal surface protein A gene of 
Streptococcus pneumoniae encoding the isolated amino acid 
molecule. 

The invention also provides PCR primers or 
hybridization probes comprising the isolated DNA molecule. 

The invention additionally provides an antigenic, 
vaccine or immunological composition comprising the amino acid 
molecule. 

The invention includes an isolated DNA molecule 
comprising nucleotides 1 to 26, 1967 to 1990, 161 to 187, 1093 to 
1117, or 1312 to 1331 or 1333 to 1355 of a pneumococcal surface 
protein A gene of Streptococcus pneumoniae. The DNA molecule can 
be used as a PCR primer or hybridization probe; and therefore the 
invention comprehends a PCR primer or hybridization probe 
comprising the isolated DNA molecule. 

The invention also includes an isolated DNA molecule 
comprising a fragment having homology with a portion of a 
pneumococcal surface protein A gene of Streptococcus pneumoniae. 
The DNA preferably is the following (which include the portion 
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having homology and restriction sites, and selection of other 
restriction sites or sequences for such DNA is within the ambit 
of the skilled artisan from this disclosure) : 

CCGGATCCAGCTCCTGCACCAAAAAC ; 
GCGCGTCGACGGCTTAAACCCATTCACCATTGG ; 
CCGGATCCTGAGCCAGAGCAGTTGGCTG ; 
CCGGATCCGCTCAAAGAGATTGATGAGTCTG ; 
GCGGATCCCGTAGCCAGTCAGTCTAAAGCTG ; 
CTGAGTCGACTGGAGTTTCTGGAGCTGGAGC ; 
CCGGATCCAGCTCCAGCTCCAGAAACTCCAG ; 
GCGGATCCTTGACCAATATTTACGGAGGAGGC ; 
GTTTTTGGTGCAGGAGCTGG ; 
GCTATGGGCTACAGGTTG ; 
CCACCTGTAGCCATAGC ; 

CCGCATCCAGCGTGCCTATCTTAGGGGCTGGTT ; and 
GCAAG CTTATG ATATAG AAATTTGTAAC 

(thus, the invention broadly comprehends DNA homologous to 
portions of pspA; preferably further including restriction 
sequences) . 

These DNA molecules can be used as PCR primers or 
probes; and thus, the invention comprehends a primer or probe 
comprising and of these molecules* 

The invention further still provides PCR probe (s) which 
distinguishes between pspA and pspA-like nucleotide sequence, as 
well as PCR probe (s) which hybridizes to both pspA and pspA-like 
nucleotide sequences. 

Additionally, the invention includes a PspA extract 
prepared by a process comprising: growing pneumococci in a first 
medium containing choline chloride, eluting live pneumococci with 
a choline chloride containing salt solution, and growing the 
pneumococci in a second medium containing an alkanolamine and 
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substantially no choline; as well as a PspA extract prepared by 
that process and further comprising purifying PspA by isolation 
on a choline-Sepharose affinity column. These processes are also 
included in the invention. 

An immunological composition comprising thses extracts 
is comprehended by the invention, as well as an immunological 
composition comprising the full length PspA- 

A method for enhancing the immunogenicity of a PspA- 
containing immunological composition comprising, in said 
composition, the C-terminal portion of PspA, is additionally 
comprehended, as well. 

An immunological composition comprising at least two 
PspAs. The latter immunological composition can have the PspAs 
from different groups or families; the groups or families can be 
based on RFLP or sequence studies (see, e.g., Fig. 13) ♦ 

Further, the invention provides an isolated amino acid 
molecule comprising pneumococcal surface protein C, PspC, of 
Streptococcus pneumoniae having an alpha-helical, proline rich 
and repeat regions, an isolated DNA molecule comprising a 
pneumcoccal surface protein C gene encoding the aforementioned 
PspC, and primers and hybrization probes consisting essentially 
of the isolated DNA molecule. 

Still further, an isolated amino acid molecule 
comprising pneumococcal surface protein C, PspC, of Streptococcus 
pneumoniae is provided, having an alpha-helical, proline rich and 
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repeat regions, having substantial homology with a protection 
eliciting region of PspA, and an isolated DNA molecule comprising 
a pneumcoccal surface protein C gene encoding the aforementioned 
PspC, and primers and hybrization probes consisting essentially 
of the isolated DNA molecule are provided by the present 
invention. 

Additionally, the present invention provides 
immunological compositions comprising PspC. 

These and other embodiments are disclosed or are 
obvious from the following detailed description. 
BRIEF DESCRIPTION OF THE FIGURES 

Figures 1A and IB show: Evaluation of digested olasmid 
constructs . Fig. 1A: 1% agarose gel electrophoresis of plasmids 
isolated from transformed E. coli BL21(DE3) strains stained with 
ethidium bromide. Lane 1: 1 kb DNA ladder (sizes noted in kb) , 
lane 2: pRCT125; lane 3: pRC105, lane 4: DBL5 pspA insert, lane 
5: pRCT113, lane 6: BG9739 pspA insert, lane 7: pRCT117, and lane 
8: L81905 pspft insert. Fig. IB: Corresponding Southern blot of 
gel in Fig. 1A probed with full-length Rxl pspA and hybridization 
detected as described in Example 1. The arrow indicates the 1.2 
kb pspA digested inserts from plasmid constructs and the PCR- 
amplified pspA fragments from the pneumococcal donor strains used 
in cloning. 

Figure 2 shows: Evaluation of strain RCT105 cell 
fractions cont aining truncated DBL5 PspA . Proteins from E. coli 
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cell fractions were resolved by 10% SDS-PAGE, transferred to NC, 
and probed with MAb XiR278. Lane 1: molecular weight markers 
(noted in kDa) , lane 2: full-length, native DBL5 PspA, lane 3: 
uninduced cells, lanes 4-6: induced cells; 1 hr, 2 hr, and 3 hr 
of IPTG induction respectively, lane 7: periplasmic proteins, 
lane 8: cytoplasmic proteins, and lane 9: insoluble cell 
wall /membrane material. 

Figure 3 shows: SDS-PAGE of R36A PspA (80 ng) column 
isolated from CDM-ET and an equal volume of an equivalent WG44.1 
prep* Identical gels are shown stained with Bio-Rad silver kit 
(A) or immunoblotted with PspA MAb XiR278(B). The PspA isolated 
from R36A shows the characteristic monomer (84 kDa) and dimer 
bands * 

Figure 4 shows: Cell lysates of pneumococcal isolates 
MC27 and MC28 were subjected to SDS-PAGE and transferred to 
nitrocellulose for Western blotting with seven MAb to PspA. 7D2 
detected a protein of 82 kDa in each isolate and XiR278 and 2A4 
detected a protein of 190 kDa in each isolate. MAb Xi64, Xil26, 
1A4 and SR4W4 were not reactive. Strains MC25 and MC26 yielded 
identical results. 

Figure 5 (Figs. 5A and 5B) shows: Southern blot of 
Hind III digest of MC25-MC28 chromosomal DNA developed at a 
stringency greater than 95 percent. A digest of Rxl DNA was used 
as a comparison. The blot was probed with LSMpspA13/2, a full 
length Rxl probe (Fig. 5) and LSMpspAl2/6 a 5' probe of Rxl pspA 
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(Fig. 5) . The same concentration of Rxl DNA was used in both 
panels, but the concentrations of MC25-MC28 DNA in Fig. 5B were 
half that used in Fig. 5A to avoid detection of partial digests. 

Figure 6 shows: RFLP of amplified pspA. PspA from MC25 
was amplified by PCR using 5' and 3' primers for pspA (LSM13 and 
LSM, respectively) . The amplified DNA was digested with 
individual restriction endonucleases prior to electrophoresis and 
staining with ethidium bromide. Lane 1 Bell, Lane 2 BAAfHI, Lane 
3 BstNI, Lane 4 PstI, Lane 5 Sad, Lane 6 EcoRI, Lane 7 Smal, 
Lane 8 KpnI. 

Figure 7 shows: A depiction of PspA showing the 
relative location and orientation of the oligonucleotides. 

Figure 8 shows: Derivatives of the S. pneumoniae D39- 
Rxi family. 

Figures 9 to 10 show: Electrophoresis of pspA or 
amplified pspA product with Hhal (Fig. 9) , Sau3AI (Fig. 10) . 

Figure 11 shows: RFLP pattern of two isolates from six 

families. 

Figure 12 shows: RFLP pattern of two isolates from six 
families (using products from amplification with SKH2 and LSM13) . 

Figure 13 shows: Sequence primarily in the N- terminal 
half of PspA. 

Figure 14 shows: Cell lysates of pneumococcal isolates 
MC27 and MC28, subjected to SDS-PAGE and Western blotting with 
seven MAbs to PspA; 7D2 detected a protein of 82 kDa in each 
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isolate, and Xi278 and 2A4 detected a protein of 190 kDa in each 
isolate; MAbs Xi64, Xil26, 1A4 and SR4W4 were not reactive; 
strains MC25 and MC26 yielded identical results (not shown) . 

Figure 15A and 15B show: a Southern blot of Hind III 
digest of MC25-28 chromosomal DNA, using a digest of Rxl DNA as a 
comparison; the blot was probed with LSMpspA13/2 , a full length 
Rxl probe (A), and LSMpspA12/6, a 5' probe of Rxl pspA (B) ; the 
same concentration of Rxl DNA was used in both panels, but the 
concentrations of MC25-28 DNA in B were half that used in A to 
avoid detection of partial digests. 

Figures 15C and 15D show: the nucleotide sequences of 
primers LSM13 , LSM2 , LSM12 and LSM6, and that of probes 
LSMpspA13/2 and LSMpspA12/6. 

Figure 16 shows: RFLP of amplified pspA, wherein PspA 
from MC25 was amplified by PCR using 5' and 3' primers for pspA 
(LSM13 and LSM2, respectively); the amplified DNA was digested 
with individual restriction endonucleases prior to 
electrophoresis and staining with ethidium bromide; Bel I was 
used in lane 1; BamH I was used in lane 2; BstN I was used in 
lane 3; Pst I was used in lane 4; Sac I was used in lane 5; EcoR 
I was used in lane 6; Sma I was used in lane 7; and Kpn I was 
used in lane 8* 

Figure 17 shows: position and orientation of 
oligonucleotides relative to domains encoded by pspA; numbers 
along the bottom of the Figure represent amino acids in the 
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0 

mature PspA polypeptide from strain Rxl f and arrows represent the 
relative position (not to scale) and orientation of 
oligonucleotides . 

Figure 18 shows; a restriction map of the pZero vector. 

Figure 19 shows: the nucleotide sequences of SKH2, 
LSM13 , N192 and C588. 

Figure 20 shows: a comparison of the structural motifs 
of PspA and PspC; PspA has a smaller alpha-helical region, and 
does not contain the direct repeats within the alpha-helix 
(indicated by the dashed lines); the alpha-helical regions which 
are homologous between PspA and PspC are indicated by the dashed 
lines) ; the alpha-helical regions which are homologous between 
PspA and PspC are indicated by the striped pattern; and PCR 
primers are indicated by the arrows ♦ 

Figure 21 shows: the amino acid and nucleotide 
sequence of PspC, wherein the putative -10 and -35 regions are 
underlined, and the ribosomal binding site is in lower case. 

Figure 22 shows: the Bestfit analysis of PspA and 
PspC; percent identity is 69% and percent similarity is 77%; 
amino acids of PspA are one the bottom line (1-588) and amino 
acids of PspC are on the top line (249-891) , and a dashed line 
indicated identity. 

Figure 23 shows: the coiled coil motif of the alpha- 
helix of PspC; amino acids that are not in the coiled coil motif 
are in the right column. 
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Figure 24 shows: a matrix plot comparison of the 
repeat regions of the alpha-helical region of PspC. 

Figure 25 shows: the sequence of the alpha helical and 
proline regions of LXS532 (PspC.D39). 

Figure 26 shows: a comparison of nucleotides of 
pspA. Rxl to pspC. D39 - 

Figure 27 shows: a BESTFIT analysis of pspC. EF6797 and 

pspC.D39 • 

Figure 28 shows: the amino acid comparison of PspC of 
EF6797 and D39. 

Figure 29 shows: the amino acid comparison of PspC.D39 
and PspA.Rxl. 
DETAILED DESCRIPTION 

Knowledge of and familiarity with the applications 
incorporated herein by reference is assumed; and, those 
applications disclose the sequence of pspA as well as certain 
portions thereof, and PspA and compositions containing PspA. 

As discussed above and in the following Examples, the 
invention relates to truncated PspA, e.g., PspA C-terminal to 
position 192 such as a. a. 192-588 ("BCIOO") 192-299 and 192-260 
of PspA eliciting cross-protection, as well as to DNA encoding 
such truncated PspA (which amplify the coding for these amino 
acid regions homologous to most PspAs) . 

The invention further relates to a pspA-like gene, or a 
pspC gene and portions thereof (e.g., probes, primers) which can 
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hybridize thereto and/or amplify that gene, as well as to DNA 
molecules which hybridize to pspA, so that one can, by 
hybridization assay and/or amplification, ascertain the presence 
of a particular pneumococcal strain; and, the invention provides 
that a PspC can be produced by the pspA-like or pspC sequence 
(which PspC can be used like PspA) . 

Indeed, the invention further relates to 
oligonucleotide probes and /or primers which react with pspA 
and/or pspC of many, if not all, strains, so as to permit 
identification, detection or diagnosis of any pneumococcal 
strain, as well as to expression products of such probes and/or 
primers, which can provide cross-reactive epitopes of interest. 

The repeat region of pspA and/or pspC is highly 
conserved such that the present invention provides 
oligonucleotide probes or primers to this region reactive with 
most, if not all strains, thereby providing diagnostic assays and 
a means for identifying epitopes of interest. 

The invention demonstrates that the pspC gene is 
homologous to the pspA gene in the leader sequence, first portion 
of the proline-rich region and in the repeat region; but, these 
genes differ in the second portion of their proline-rich regions 
and at the very 3' end of the gene encoding the 17 amino acid 
tail of PspA. The product of the pspC gene is expected to lack a 
C-terminal tail, suggesting different anchoring than PspA. Drug 
interference with functions such as surface binding of the coding 
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for repeat regions of pspA and the pspC genes, or with the repeat 
regions of the expression products, is therefore a target for 
intervention of pneumococcal infection. 

Further still, the invention provides evidence of 
additional pspA homologous sequences, in addition to pspA and the 
pspC sequence. The invention, as mentioned above, includes 
oligonucleotide probes or primers which distinguish between pspA 
and the pspC sequence, e.g., LSMl and LSM2, useful for diagnostic 
detecting, or isolating purposes; and LSM1 and LSM10 or LSM1 and 
LSM7 which amplify a portion of the pspc gene, particularly the 
portion of that gene which encodes an antigenic, immunological or 
protective protein. 

The invention further relates to a method for the 
isolation of native PspA by growth of pneumococci medium 
containing high concentrations of (about 0.9% to about 1.4%, 
preferably 1.2%) choline chloride, elation of live pneumococci 
with a salt solution containing choline chloride, e.g., about 1% 
about 3%, preferably 2% choline chloride, and growth of 
pneumococci in medium in which the choline in the medium has been 
almost or substantially completely replaced with a lower 
alkanolamine, e.g., C x -C 6 , preferably C 2 alkanolamine, i.e., 
preferably C 2 alkanolamine, i.e., preferably ethanolamine (e.g., 
0.0000005% to 0.0000015%, preferably 0.000001% choline chloride 
plus 0.02% to 0.04% alkanolamine (ethanolamine), preferably 
0.03%). PspA from such pneumococci is then preferably isolated 
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from a choline-sepharose affinity column, thereby providing 
highly purified PspA. Such isolated and/or purified PspA is 
highly immunogenic and is useful in antigenic, immunological or 
vaccine composition. 

Indeed, the growth media of the pneumococci grown in 
the presence of the alkanolamine (rather than choline) contains 
PspA and is itself highly immunogenic and therefore useful as an 
antigenic, immunological or vaccine composition; and, is rather 
inexpensive to produce. Per microgram of PspA, the PspA in the 
alkanolamine medium is much more protective than PspA isolated by 
other means, e.g., from extracts. Perhaps, without wishing to 
necessarily be bound by any one particular theory, there is a 
synergistic effect upon PspA by the other components present 
prior to isolation, or simply PspA is more protective (more 
antigenic) prior to isolation and/or purification (implying a 
possibility of some loss of activity from the step of isolation 
and/or purification) . 

The invention further relates to the N-terminal 115 
amino acids of PspA, which is useful for compositions comprising 
an epitope of interest, immunological or vaccine compositions, as 
well as the DNA coding therefor, which is useful in preparing 
these N-terminal amino acids by recombination, or for use as 
probes and/or primers for hybridization and/or amplification for 
identification, detection or diagnosis purposes. 
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The invention further demonstrates that there is a 
grouping among the pspA RFLP families . This provides a method of 
identifying families of different PspAs based on RFLP pattern of 
pspAs, as well as a means for obtaining diversity of PspAs in an 
antigenic, immunological or vaccine composition; and, a method of 
characterizing clonotypes of PspA based on RFLP patterns of PspA. 
And, the invention thus provides oligonucleotides which permit 
amplification of most, e.g., a majority, if not all of S. 
pneumoniae and thereby permit RFLP analysis of a majority, if not 
all, S. pneumoniae. 

The invention also provides PspC, having an approximate 
molecular weight of 105 kD, with an estimated pi of 6.09, and 
comprising an alpha-helical region, followed by a proline-rich 
domain and repeat region. A major cross-protective region of 
PspA comprises the C-terminal third of the alpha-helical region 
(between residues 192 and 260 of PspA) , which region accounts for 
the binding of 4 of 5 cross-protective MAb, and PspA fragments 
comprising this region can elicit cross-protective immunity in 
mice. Homology between PspC and PspA begins at amino acid 148 of 
PspA, thus including the region from 192 to 299, and including 
the entire PspC sequence C-terminal of amino acid 486. Due to 
the substantial sequence homology between PspA and PspC in a 
region comprising the epitopes of interest, known to be 
protection eliciting, PspC is likely to comprise epitopes of 
interest similar to those found in PspA, Antibodies specific for 
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this region of PspA, i.e., between amino acids 148 and 299, 
should cross-react with PspC, and thus afford protection by 
reacting with PspC and PspA. Similarly, immunization with PspC 
would be expected to elicit antibodies cross-protective against 
PspA. 

An epitope of interest is an antigen or immunogen or 
immunologically active fragment thereof from a pathogen or toxin 
of veterinary or human interest* 

The present invention provides an immunogenic, 
immunological or vaccine composition containing the pneumococcal 
epitope of interest, and a pharmaceutical^ acceptable carrier or 
diluent. An immunological composition containing the 
pneumococcal epitope of interest, elicits an immunological 
response - local or systemic. The response can, but need not be, 
protective. Am immunogenic composition containing the 
pneumococcal epitope of interest, likewise elicits a local or 
systemic immunological response which can, but need not be, 
protective. A vaccine composition elicits a local or systemic 
protective response. Accordingly, the terms "immunological 
composition" and "immunogenic composition" include a "vaccine 
composition" (as the two former terms can be protective 
compositions) * 

The invention therefore also provides a method of 
inducing an immunological response in a host mammal comprising 
administering to the host an immunogenic, immunological or 
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vaccine composition comprising the pneumococcal epitope of 
interest, and a pharmaceutical ly acceptable carrier or diluent. 

The DNA encoding the pneumococcal epitope of interest 
can be DNA which codes for full length PspA, PspC, or fragments 
thereof. A sequence which codes for a fragment of PspA or PspC 
can encode that portion of PspA or PspC which contains an epitope 
of interest, such as a protection-eliciting epitope of the 
protein. 

Regions of PspA and PspC have been identified from the 
Rxl strain of S. pneumoniae which not only contain protection- 
eliciting epitopes, but are also sufficiently cross-reactive with 
other PspAs from other S. pneumoniae strains so as to be suitable 
candidates for the region of PspA to be incorporated into a 
vaccine, immunological or immunogenic composition. Epitopic 
regions of PspA include residues 1 to 115, 1 to 314, 192 to 260 
and 192 to 588. DNA encoding fragments of PspA can comprise DNA 
which codes for the aforementioned epitopic regions of PspA; or 
it can comprise DNA encoding overlapping fragments of PspA, e.g., 
fragment 192 to 588 includes 192 to 260, and fragment l to 314 
includes 1 to 115 and 192 to 260. 

As to epitopes of interest, one skilled in the art can 
determine an epitope of immunodominant region of a peptide or 
polypeptide and ergo the coding DNA therefor from the knowledge 
of the amino acid and corresponding DNA sequences of the peptide 
or polypeptide, as well as from the nature of particular amino 
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acids (e.g., size, charge, etc.) and the codon dictionary, 
without undue experimentation. 

A general method for determining which portions of a 
protein to use in an immunological composition focuses on the 
size and sequence of the antigen of interest. "In general, large 
proteins, because they have more potential determinants are 
better antigens than small ones. The more foreign an antigen, 
that is the less similar to self configurations which induce 
tolerance, the more effective it is in provoking an immune 
response." Ivan Roitt, Essential Immunology . 1988. 

As to size, the skilled artisan can maximize the size 
of the protein encoded by the DNA sequence to be inserted into 
the viral vector (keeping in mind the packaging limitations of 
the vector) . To minimize the DNA inserted while maximizing the 
size of the protein expressed, the DNA sequence can exclude 
introns (regions of a gene which are transcribed but which are 
subsequently excised from the primary RNA transcript) . 

At a minimum, the DNA sequence can code for a peptide 
at least 8 or 9 amino acids long. This is the minimum length 
that a peptide needs to be in order to stimulate a CD4+ T cell 
response (which recognizes virus infected cells or cancerous 
cells) . A minimum peptide length of 13 to 25 amino acids is 
useful to stimulate a CD8+ T cell response (which recognizes 
special antigen presenting cells which have engulfed the 
pathogen). See Kendrew, supra. However, as these are minimum 
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lengths, these peptides are likely to generate an immunological 
response, i.e., an antibody or T cell response; but, for a 
protective response (as from a vaccine composition) , a longer 
peptide is preferred. 

With respect to the sequence, the DNA sequence 
preferably encodes at least regions of the peptide that generate 
an antibody response or a T cell response. One method to 
determine T and B cell epitopes involves epitope mapping. The 
protein of interest "is fragmented into overlapping peptides with 
proteolytic enzymes. The individual peptides are then tested for 
their ability to bind to an antibody elicited by the native 
protein or to induce T cell or B cell activation. This approach 
has been particularly useful in mapping T-cell epitopes since the 
T cell recognizes short linear peptides complexed with MHC 
molecules. The method is less effective for determining B-cell 
epitopes" since B cell epitopes are often not linear amino acid 
sequence but rather result from the tertiary structure of the 
folded three dimensional protein. Janis Kuby, Immunology , (1992) 
pp. 79-80. 

Another method for determining an epitope of interest 
is to choose the regions of the protein that are hydrophilic. 
Hydrophilic residues are often on the surface of the protein and 
therefore often the regions of the protein which are accessible 
to the antibody. Janis Kuby, Immunology , (1992) P. 81. 
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Yet another method for determining an epitope of 
interest is to perform an X-ray cyrstallographic analysis of the 
antigen (full length) -antibody complex. Janis Kuby, Immunology , 

(1992) p. 80- 

Still another method for choosing an epitope of 
interest which can generate a T cell response is to identify from 
the protein sequence potential HLA anchor binding motifs which 
are peptide sequences which are known to be likely to bind to the 
MHC molecule. 

The peptide which is a putative epitope, to generate a 
T cell response, should be presented in a MHC complex ♦ The 
peptide preferably contains appropriate anchor motifs for binding 
to the MHC molecules, and should bind with high enough affinity 
to generate an immune response. Factors which can be considered 
are: the HLA type of the patient (vertebrate, animal or human) 
expected to be immunized, the sequence of the protein, the 
presence of appropriate anchor motifs and the occurance of the 
peptide sequence in other vital cells. 

An immune response is generated, in general, as 
follows: T cells recognize proteins only when the protein has 
been cleaved into smaller peptides and is presented in a complex 
called the "major histocompatability complex MHC" located on 
another cell's surface* There are two classes of MHC complexes - 
class I and class II, and each class is made up of many different 
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alleles* Different patients have different types of MHC complex 
alleles; they are said to have a 'different HLA type'. 

Class I MHC complexes are found on virtually every cell 
and present peptides from proteins produced inside the cell. 
Thus, Class I HHC complexes are useful for killing cells which 
when infected by viruses or which have become cancerous and as 
the result of expression of an oncogene. T cells which have a 
protein called CD4 on their surface, bind to the MHC class I 
cells and secrete lymphokines. The lymphokines stimulate a 
response; cells arrive and kill the viral infected cell* 

Class II MHC complexes are found only on antigen- 
presenting cells and are used to present peptides from 
circulating pathogens which have been endocytosed by the antigen- 
presenting cells. T cells which have a protein called CD8 bind 
to the MHC class II cells and kill the cell by exocytosis of 
lytic granules. 

Some guidelines in determining whether a protein is an 
epitopes of interest which will stimulate a T cell response, 
include: Peptide length - the peptide should be at least 8 or 9 
amino acids long to fit into the MHC class I complex and at least 
13-25 amino acids long to fit into a class II MHC complex. This 
length is a minimum for the peptide to bind to the MHC complex. 
It is preferred for the peptides to be longer than these lengths 
because cells may cut the expressed peptides. The peptide should 
contain an appropriate anchor motif which will enable it to bind 
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to the various class I or class II molecules with high enough 
specificity to generate an immune response (See Bocchia, M» et 
al, Specific Binding of Leukemia Oncogene Fusion Protein Peptides 
to HLA Class I Molecules , Blood 85:2680-2684; Englehard, VH, 
Structure of peptides associated with class I and class II MHC 
molecules Ann. Rev* Immunol. 12:181 (1994))* This can be done, 
without undue experimentation, by comparing the sequence of the 
protein of interest with published structures of peptides 
associated with the MHC molecules. Protein epitopes recognized 
by T cell receptors are peptides generated by enzymatic 
degradation of the protein molecule and are prestnted on the cell 
surface in association with class I or class II MHC molecules* 

Further, the skilled artisan can ascertain an epitope 
of interest by comparing the protein sequence with sequences 
listed in the protein data base. Regions of the protein which 
share little or no homology are better choices for being an 
epitope of that protein and are therefore useful in a vaccine or 
immunological composition. Regions which share great homology 
with widely found sequences present in vital cells should be 
avoided . 

Even further, another method is simply to generate or 
express portions of a protein of interest, generate monoclonal 
antibodies to those portions of the protein of interest, and then 
ascertain whether those antibodies inhibit growth in vitro of the 
pathogen from which the from which the protein was derived. The 
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skilled artisan can use the other guidelines set forth in this 
disclosure and in the art for generating or expressing portions 
of a protein of interest for analysis as to whether antibodies 
thereto inhibit growth in vitro. For example, the skilled 
artisan can generate portions of a protein of interest by: 
selecting 8 to 9 or 13 to 25 amino acid length portions of the 
protein , selecting hydrophilic regions, selecting portions shown 
to bind from X-ray data of the antigen (full length) -antibody 
complex, selecting regions which differ in seguence from other 
proteins, selecting potential HLA anchor binding motifs, or any 
combination of these methods or other methods known in the art. 

Epitopes recognized by antibodies are expressed on the 
surface of a protein. To determine the regions of a protein most 
likely to stimulate an antibody response one skilled in the art 
can preferably perform an epitope map, using the general methods 
described above, or other mapping methods known in the art* 

As can be seen from the foregoing, without undue 
experimentation, from this disclosure and the knowledge in the 
art, the skilled artisan can ascertain the amino acid and 
corresponding DNA sequence of an epitope of interest for 
obtaining a T cell, B cell and/or antibody response. In 
addition, reference is made to Gefter et al„, U.S. Patent No. 
5,019,384, issued May 28, 1991, and the documents it cites, 
incorporated herein by reference (Note especially the "Relevant 
Literature" section of this patent, and column 13 of this patent 
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which discloses that: "A large number of epitopes have been 
defined for a wide variety of organisms of interest. Of 
particular interest are those epitopes to which neutralizing 
antibodies are directed. Disclosures of such epitopes are in 
many of the references cited in the Relevant Literature 
section. ") 

Further, the invention demonstrates that more than one 
serologically complementary PspA molecule can be in an antigenic , 
immunological or vaccine composition, so as to elicit better 
response, e.g., protection, for instance, against a variety of 
strains of pneumococci; and, the invention provides a system of 
selecting PspAs for a multivalent composition which includes 
cross-protection evaluation so as to provide a maximally 
efficacious composition. 

The determination of the amount of antigen, e.g., PspA 
or truncated portion thereof and optional adjuvant in the 
inventive compositions and the preparation of those compositions 
can be in accordance with standard techniques well known to those 
skilled in the pharmaceutical or veterinary arts. In particular, 
the amount of antigen and adjuvant in the inventive compositions 
and the dosages administered are determined by techniques well 
known to those skilled in the medical or veterinary arts taking 
into consideration such factors as the particular antigen, the 
adjuvant (if present) , the age, sex, weight, species and 
condition of the particular patient, and the route of 
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administration. For instance, dosages of particular PspA 
antigens for suitable hosts in which an immunological response is 
desired, can be readily ascertained by those skilled in the art 
from this disclosure (see, e.g., the Examples), as is the amount 
of any adjuvant typically administered therewith. Thus, the 
skilled artisan can readily determine the amount of antigen and 
optional adjuvant in compositions and to be administered in 
methods of the invention. Typically, an adjuvant is commonly 
used as 0.001 to 50 wt% solution in phosphate buffered saline, 
and the antigen is present on the order of micrograms to 
milligrams, such as about 0.0001 to about 5 wt%, preferably about 
0.0001 to about 1 wt%, most preferably about 0.0001 to about 0.05 
wt% (see, e.g., Examples below or in applications cited herein). 

Typically, however, the antigen is present in an amount 
on the order of micrograms to milligrams, or, about 0.001 to 
about 20 wt%, preferably about 0.01 to about 10 wt%, and most 
preferably about 0.05 to about 5 wt% (see, e.g., Examples below). 

Of course, for any composition to be administered to an 
animal or human, including the components thereof, and for any 
particular method of administration, it is preferred to determine 
therefor: toxicity, such as by determining the lethal dose (LD) 
and LD 50 in a suitable animal model e.g., rodent such as mouse; 
and, the dosage of the composition (s) , concentration of 
components therein and timing of administering the 
composition (s) , which elicit a suitable immunological response, 
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such as by titrations of sera and analysis thereof for antibodies 
or antigens, e.g., by ELISA and/or RFFIT analysis. Such 
determinations do not require undue experimentation from the 
knowledge of the skilled artisan, this disclosure and the 
documents cited herein. And, the time for sequential 
administrations can be ascertained without undue experimentation. 

Examples of compositions of the invention include 
liquid preparations for orifice, e.g., oral, nasal, anal, 
vaginal, peroral, intragastric, mucosal (e.g., perlingual, 
alveolar, gingival, olfactory or respiratory mucosa) etc., 
administration such as suspensions, syrups or elixirs; and, 
preparations for parenteral, subcutaneous, intradermal, 
intramuscular or intravenous administration {e.g., injectable 
administration) , such as sterile suspensions or emulsions. Such 
compositions may be in admixture with a suitable carrier, 
diluent, or excipient such as sterile water, physiological 
saline, glucose or the like. The compositions can also be 
lyophilized. The compositions can contain auxiliary substances 
such as wetting or emulsifying agents, pH buffering agents, 
gelling or viscosity enhancing additives, preservatives, 
flavoring agents, colors, and the like, depending upon the route 
of administration and the preparation desired. Standard texts, 
such as "REMINGTON'S PHARMACEUTICAL SCIENCE", 17th edition, 1985, 
incorporated herein by reference, may be consulted to prepare 
suitable preparations, without undue experimentation. 
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Compositions of the invention, are conveniently 
provided as liquid preparations, e.g., isotonic aqueous 
solutions, suspensions, emulsions or viscous compositions which 
may be buffered to a selected pH. If digestive tract absorption 
is preferred, compositions of the invention can be in the "solid" 
form of pills, tablets, capsules, caplets and the like, including 
"solid" preparations which are time-released or which have a 
liquid filling, e.g., gelatin covered liquid, whereby the gelatin 
is dissolved in the stomach for delivery to the gut. If nasal or 
respiratory (mucosal) administration is desired, compositions may 
be in a form and dispensed by a squeeze spray dispenser, pump 
dispenser or aerosol dispenser. Aerosols are usually under 
pressure by means of a hydrocarbon. Pump dispensers can 
preferably dispense a metered dose or, a dose having a particular 
particle size. 

Compositions of the invention can contain 
pharmaceutically acceptable flavors and/or colors for rendering 
them more appealing, especially if they are administered orally. 
The viscous compositions may be in the form of gels, lotions, 
ointments, creams and the like and will typically contain a 
sufficient amount of a thickening agent so that the viscosity is 
from about 2500 to 6500 cps, although more viscous compositions, 
even up to 10,000 cps may be employed- Viscous compositions have 
a viscosity preferably of 2 500 to 5000 cps, since above that 
range they become more difficult to administer. However, above 
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that range, the compositions can approach solid or gelatin forms 
which are then easily administered as a swallowed pill for oral 
ingestion. 

Liquid preparations are normally easier to prepare than 
gels, other viscous compositions, and solid compositions. 
Additionally, liquid compositions are somewhat more convenient to 
administer, especially by injection or orally, to animals, 
children, particularly small children, and others who may have 
difficulty swallowing a pill, tablet, capsule or the like, or in 
multi-dose situations. Viscous compositions, on the other hand, 
can be formulated within the appropriate viscosity range to 
provide longer contact periods with mucosa, such as the lining of 
the stomach or nasal mucosa. 

Obviously, the choice of suitable carriers and other 
additives will depend on the exact route of administration and 
the nature of the particular dosage form, e.g., liquid dosage 
form [e.g., whether the composition is to be formulated into a 
solution, a suspension, gel or another liquid form], or solid 
dosage form [e.g. , whether the composition is to be formulated 
into a pill, tablet, capsule, caplet, time release form or 
liquid-filled form] . 

Solutions, suspensions and gels, normally contain a 
major amount of water (preferably purified water) in addition to 
the antigen, lipoprotein and optional adjuvant. Minor amounts of 
other ingredients such as pH adjusters (e.g., a base such as 
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NaOH) , emulsifiers or dispersing agents, buffering agents, 
preservatives, wetting agents, jelling agents, (e.g., 
methylcellulose) , colors and/or flavors may also be present. The 
compositions can be isotonic, i.e., it can have the same osmotic 
pressure as blood and lacrimal fluid. 

The desired isotonicity of the compositions of this 
invention may be accomplished using sodium chloride, or other 
pharmaceutical^ acceptable agents such as dextrose, boric acid, 
sodium tartrate, propylene glycol or other inorganic or organic 
solutes. Sodium chloride is preferred particularly for buffers 
containing sodium ions. 

Viscosity of the compositions may be maintained at the 
selected level using a pharmaceutical^ acceptable thickening 
agent. Methylcellulose is preferred because it is readily and 
economically available and is easy to work with. Other suitable 
thickening agents include, for example, xanthan gum, 
carboxymethyl cellulose, hydroxypropyl cellulose, carbomer, and 
the like. The preferred concentration of the thickener will 
depend upon the agent selected. The important point is to use an 
amount which will achieve the selected viscosity. Viscous 
compositions are normally prepared from solutions by the addition 
of such thickening agents. 

A pharmaceutical ly acceptable preservative can be 
employed to increase the shelf -life of the compositions. Benzyl 
alcohol may be suitable, although a variety of preservatives 
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including, for example, parabens, thimerosal, chlorobutanol, or 
benzalkonium chloride may also be employed. A suitable 
concentration of the preservative will be from 0.02% to 2% based 
on the total weight although there may be appreciable variation 
depending upon the agent selected* 

Those skilled in the art will recognize that the 
components of the compositions must be selected to be chemically 
inert with respect to the PspA antigen and optional adjuvant. 
This will present no problem to those skilled in chemical and 
pharmaceutical principles, or problems can be readily avoided by 
reference to standard texts or by simple experiments (not 
involving undue experimentation) , from this disclosure and the 
documents cited herein. 

The immunologically effective compositions of this 
invention are prepared by mixing the ingredients following 
generally accepted procedures. For example the selected 
components may be simply mixed in a blender, or other standard 
device to produce a concentrated mixture which may then be 
adjusted to the final concentration and viscosity by the addition 
of water or thickening agent and possibly a buffer to control pH 
or an additional solute to control tonicity. Generally the pH 
may be from about 3 to 7.5. Compositions can be administered in 
dosages and by techniques well known to those skilled in the 
medical and veterinary arts taking into consideration such 
factors as the age, sex, weight, and condition of the particular 
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patient or animal, and the composition form used for 
administration (e.g., solid vs. liquid). Dosages for humans or 
other mammals can be determined without undue experimentation by 
the skilled artisan, from this disclosure, the documents cited 
herein, the Examples below (e.g., from the Examples involving 
mice) . 

Suitable regimes for initial administration and booster 
doses or for sequential administrations also are variable, may 
include an initial administration followed by subsequent 
administrations; but nonetheless, may be ascertained by the 
skilled artisan, from this disclosure, the documents cited 
herein, and the Examples below. 

PGR techniques for amplifying sample DNA for diagnostic 
detection or assay methods are known from the art cited herein 
and the documents cited herein (see Examples) , as are 
hybridization techniques for such methods. And, without undue 
experimentation, the skilled artisan can use gene products and 
antibodies therefrom in diagnostic, detection or assay methods by 
procedures known in the art. 

The following Examples are provided for illustration 
and are not to be considered a limitation of the invention. 
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EXAMPLES 

EXAMPLE 1 - Truncated Streptococcus pneumoniae PspA Molecules 

Elicit Cross-Protective Immunity Against 
Pneumococcal Challenge 

Since the isolation of S. pneumoniae from human saliva 
in 1881 and its subsequent connection with lobar pneumonia two 
years later, human disease resulting from pneumococcal infection 
has been associated with a significant degree of morbidity and 
mortality. A recent survey of urgently needed vaccines in the 
developing and developed world places an improved pneumococcal 
vaccine among the top three vaccine priorities of industrialized 
countries. The currently licensed vaccine is a 23-valent 
composition of pneumococcal capsular polysaccharides that is only 
about 60% effective in the elderly and due to poor efficacy is 
not recommended for use in children below two years of age. 
Furthermore the growing frequency of multi-drug resistant strains 
of S. pneumoniae being isolated accentuates the need for a more 
effective vaccine to prevent pneumococcal infections. 

The immunogenic nature of proteins makes them prime 
targets for new vaccine strategies. Pneumococcal molecules being 
investigated as potential protein vaccine candidates include 
pneumolysis, neuraminidase, autolysin and PspA. All of these 
proteins are capable of eliciting immunity in mice resulting in 
extension of life and protection against death with challenge 
doses near the LD 50 . PspA is unique among these macromolecules 
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in that is can elicit antibodies in animals that protect against 
inoculums 100-fold greater than the LD 50 . 

PspA is a surface-exposed protein with an apparent 
molecular weight of 67-99 kDa that is expressed by all clinically 
relevant S. pneumoniae strains examined to date* Though PspAs 
from different pneumococcal strains are serologically variable, 
many PspA antibodies exhibit cross-reactivities with PspAs from 
unrelated strains. Upon active immunization with PspA, mice 
generate PspA antibodies that protect against subsequent 
challenge with diverse strains of S. pneumoniae. The immunogenic 
and protection-eliciting properties of PspA suggest that it may 
be a good candidate molecule for a protein-based pneumococcal 
vaccine . 

Four distinct domains of PspA have been identified 
based on DNA sequence. They include a N-terminal highly charged 
alpha -helical region, a proline-rich 82 amino acid stretch, a C- 
terminal repeat segment comprised of ten 20-amino acid repeat 
sequences, and a 17-amino acid tail. A panel of MAbs to Rxl PspA 
have been produced and the binding sites of nine of these Mabs 
were recently localized within the Rxl pspA sequence in the 
alpha-helical region. Five of the Rxl Mabs were protective in 
mice infected with a virulent pneumococcal strain, WU2. Four of 
these five protective antibodies were mapped to the distal third 
(amino acids 192-260) of the alpha-helical domain of Rxl PspA. 
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Truncated PspAs containing amino acids 192-588 or 192- 
299, from pneumococcal strain Rxl were cloned and the recombinant 
proteins expressed and evaluated for their ability to elicit 
protection against subseguent challenge with S. pneumoniae WU2 • 
As with full-length Rxl PspA, both truncated PspAs containing the 
distal alpha-helical region protected mice against fatal WU2 
pneumococcal infection. However, the recombinant PspA fragment 
extending from amino acid 192 to 588 was more immunogenic than 
the smaller fragment, probably due to its larger size. In 
addition, the protection elicited by the amino acid fragment 192- 
588 of Rxl was comparable to that elicited by full-length Rxl 
PspA. Therefore, cross-protective epitopes of other PspAs were 
also sought in the C-terminal two-thirds of the molecule. As 
discussed below, PspAs homologous to amino acids 192-588 of 
strain Rxl were amplified by PCR, cloned, and expressed in E. 
coli* Then three recombinant PspAs, from capsule type 4 and 5 
strains, were evaluated for their ability to confer cross- 
protection against challenge strains of variant capsular types. 
The data demonstrate that the truncated PspAs from capsular type 
4 and 5 strains collectively protect against or early death 
caused by challenge with capsular type 4 and 5 parental strains 
as well as type 3, 6A, and 6B S. pneumoniae. 

Bacterial strains and culture conditions . All 
pneumococci were from the culture collection of this laboratory, 
and have been described (Yother, J. et al., Infect. Immun. 1982; 
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36 : 184-188; Briles, D.E., et al., Infect. Immun. 1992; 60: 111- 
116; McDaniel, L.S., et al., Microbe Pathog. 1992; 13: 261-269; 
and McDaniel, L.S, et al M In: Ferretti, J.J. et al., eds. 
Genetics of streptococcic enterococci. and lactococci . 1995; 283- 
286), with the exception of clinical isolates TJ0893 f 0922134 and 
BG8740. Pneumococcal strains TJ0893 and 0922134 were recovered 
from the blood of a 43-year old male and an elderly female, 
respectively. S* pneumoniae BG8743 is a blood isolate from an 8- 
month old infant. Strains employed in this study included 
capsular type 3 (A66.3, EF10197, WU2) , type 4 (BG9739, EF3296, 
EF5668, L81905), type 5 (DBL5) , type 6A (DBL6A, EF6796) , type 6B 
(BG7322, BG9163, DBL1) , type 14 TJOB93), type 19 (BG8090) , and 
type 23 (0922134, BG8743). In addition, strain WG44.1, which 
expresses no detectable PspA f was employed in PspA-specif ic 
antibody analysis. All chemicals were purchased from Fisher 
Scientific, Fair Lawn, New Jersey unless indicate otherwise. 

5. pneumoniae were grown in Todd Hewitt broth (Difco, 
Detroit, Michigan) supplemented with 5% yeast extract (Difco) . 
Mid-exponential phase cultures were used for seeding inocula in 
Lactated Ringer's (Abbott laboratories, North Chicago, Illinois) 
for challenge studies. For pneumococcal strains used in 
challenge studies, inocula ranged from 2.8 to 3.8 log 10 CFU 
(verified by dilution plating on blood agar) . Plates were 
incubated overnight in a candle jar at 37°c. 
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coli DH1 and BL21(DE3) were cultured in LB medium 
(1% Bacto-tryptone (Difco) , 0.5% Bacto Yeast (Difco) , 0.5% NaCl, 
0.1% dextrose). For the preparation of cell lysates, recombinant 
E. coil were grown in minimal E medium supplemented with 0.05 M 
thiamine, 0.2% glucose, 0.1% casamino acids (Difco), and 50 mg/ml 
kanamycin. Permanent bacterial stocks were stored at -80°C in 
growth medium containing 10% glycerol. 

Construction of plasmid-based strains . pET-9a 
(Novagen, Madison, Wisconsin) was used for cloning truncated pspA 
genes from fourteen S. pneumoniae strains: DBL5, DBL6A, WU2, 
BG9739, EF5668, L81905, 0922134, BG8090, BG8743, BG9163, DBL1, 
EF3296, EF6796, and EF10197 (Table 1). pspA gene fragments, from 
fifteen strains, were amplified by PCR using two primers provided 
by Connaught Laboratories, Swiftwater, Pennsylvania Primer N192- 

5 9 GGAAGGCCATATGCTCAAAGAGATTGATGAGTCT3 ' and primer C588 - 
5 ' CCAAGGATCCTTAAACCCATTCACCATTGGC3 ' were engineered with Ndel and 
BamHI restriction endonuclease sites, respectively. PCR- 
amplified gene products were digested with BajnHI and Ndel, and 
ligated to linearized pET-9a digested likewise and further 
treated with bacterial alkaline phosphatase United States Bio- 
chemical Corporation, Cleveland, Ohio) to prevent recirculari- 
zation of the cut plasmid. Clones were first established in E. 
coli BL21(DE3) which contained a chromosomal copy of the T7 RNA 
polymerase gene under the control of an inducible lacUVS 
promoter . 
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E* coli DH1 cells were transformed by the method of 
Hanahan (Hanahan, D* J- Mol. Biol. 1983; 166 : 557-580). Stable 
transf onnants were identified by screening on LB-kanamycin 
plates. Plasmid constructs, isolated from each of these strains, 
were electroporated (Electro Cell Manipulator 600, BTX 
Electroporation System, San Diego, California) into E. coli 
BL21(DE3) and their respective strain designations are listed in 
Table 1. The pET-9a vector alone was introduced into E. coli 
BL21(DES) by electroporation to yield strain RCT125 (Table 2)* 
All plasmid constructs and PCR-amplif ied pspA gene fragments were 
evaluated by agarose gel electrophoresis (with 1 kb DNA ladder, 
Gibco BRL, Gaithersburg , Maryland) . Next, Southern analysis was 
performed using LMpspAl, a previously described full-length pspA 
probe (McDaniel. L.S. et al., Microb. Pathog. 1992; 13: 261-269) 
random primed labeled with digoxigenin-ll-dUTP (Genius System, 
Boehringer Mannheim, Indianapolis, Indiana) . Hybridization was 
detected with chemiluminescent sheets according to the 
manufacturer's instructions (Schleicher & Schuell, Keene, New 
Hampshire) . 

Cell fractionation of recombinant £. coli strains . 
Multiple cell fractions from transformed E. coli were evaluated 
for the expression of truncated PspA molecules. Single colonies 
were inoculated into 3 ml LB cultures containing kanamycin and 
grown overnight at 37°C Next, an 80 ml LB culture, inoculated 
with 1:100 dilution of the overnight culture, was grown at 37°C 
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to mid-exponential phase (A 600 of ca. 0.5) and a 1 ml sample was 
harvested and resuspended (uninduced cells) prior to induction 
with isopropylthiogalactoside (IPTG, 0.3 mM final concentration). 
Following 1, 2, and 3 hr of induction, 0.5 ml of cells were 
centrifuges , resuspended, and labeled induced cells. The 
remaining culture was divided into two aliquots, centrifuged 
(4000 x g, 10 min, DuPont Sorvall RC 5B Plus), and the 
supernatant discarded. One pellet was resuspended in 5 ml of 20 
mM Tris-HCl ph 7.4 200 mM NaCI, 1 mM (ethylenedinitrilo) - 
tetraacetic acid disodium salt (EDTA) and frozen at -20 °C 
overnight. Cells were thawed at 65 °C for 30 min, placed on ice, 
and sonicated for vive 10-sec pulses (0.4 relative output, Fisher 
Sonic Dismembrator, Dynatech Laboratories, Inc. Chantilly, 
Virginia). Next, the material was centrifuged (9000 x g, 20 min) 
and the supernatant was designed the crude extract-cytoplasmic 
fraction. The pellet was resuspended in Tris-NaCl-EDTA buffer 
and labeled the insoluble cell well and membrane fraction. The 
other pellet, from the divided induced culture, was resuspended 
in 10 ml of 30 mM Tris-HCl pH 8.0 containing 20% sucrose and 1 mM 
EDTA and incubated at room temperature for 10 min with agitation. 
Cells were then centrifuged, the supernatant removed, and the 
pellet resuspended in 5 mM MgS0 4 (10 ml, 10 min, shaking 4°C 
bath) . This material was centrifuged and the supernatant was 
designated osmotic shock-periplasmic fraction. Cell fractions 
were evaluated by SDS-PAGE and immunoblot analysis. 
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MAbs to PspA . PspA-specif ic monoclonal antibodies 
(MAbs) XiR278 and 1A4 were used as previously described (Crain, 
M.J. et al. # 1990, Infect. Immun. ; 58: 3293-3299). MAb P50-92D9 
was produced by immunization with DBL5 PspA. The PspA- 
specif icity of MAb P50-92D9 was confirmed by Western Analysis by 
its reactivity with native PspAs from S. pneumoniae DBL5, BG9739, 
EF5668, and L81095 and its failure to recognize the PspA-control 
strain WG44.1. 

SDS-PAGE and immunoblot analysis . £. coli cell 
fractions containing recombinant PspA proteins and biotinylated 
molecular weight markers (low range , Bio-Rad, Richmond, 
California) were separated by sodium dodecyl sulfate- 
polyacrylamide (10%; Bethesda Research Laboratories, 
Gaithersburg, Maryland) gel electrophoresis (SDS-PAGE) by the 
method of Laemmli (Laemmli, U.K. Nature 1970; 227 : 680-685). 
Samples were first boiled for 5 min in sample buffer containing 
60 mM Tris pH 6.8, 1% 2-B-mercaptoethanol (Sigma, St. Louis, 
Missouri), 1% SDS, 10% glycerol, and 0.01% bromophenol blue. 
Gels were subsequently transferred (1 hr, 100 volts) to 
nitrocellulose (0.45 mM pores, Millipore, Bedford, Massachusetts) 
as per the method of Towbin et al. Blots were blocked with 3% 
casein, 0.05% Tween 20 in 10 mM Tris, 0.1 M NaCl, pH 7.4 for 30 
min prior to incubating with PspA-specif ic monoclonal antibodies 
diluted in PBST for 1 hr at 25 °C. Next, the blot was washed 3 
times with PBST before incubating with alkaline phosphatase- 
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labeled goat anti-mouse immunoglobulin (Southern Biotechnology 
Associates, Inc., Birmingham, Alabama) for 1 hr at 25°C. Washes 
were performed as before and blots was developed with 0,5 mg/ml 
5-bromo-4-chloro-3-indolyl phosphate and 0-01% nitro blue 
tetrazolium (Sigma) first dissolved in 150 fil of dimethyl 
sulfoxide and then diluted in 1*5 M Tris-HCl pH 8.8. Dot blots 
were analyzed similarly, Lysate samples (2 pi) were spotted on 
nitrocellulose filters (Millipore) , allowed to dry, blocked, and 
detected as just described. 

Preparation of cell lvsates containing recombinant PspA 
proteins . Transformed E. coli strains RCT105, RCT113, RCT117, 
and RCT125 (Table 2) were grown in mid-exponential phase in 
minimal E medium before IPTG induction (2 mM final concentration, 
2 hours, 37 °C) . Cultures were harvested by centrifugation (10 
min at 9000 x g) , resuspended in Tris-acetate pH 6.9, and frozen 
at -80°C overnight. Samples were thawed at 65°C for 30 min, 
cooled on ice, and sonicated. Next the samples were treated with 
0.2 mM AEBSF (Calbiochem, La Jolla, California) at 37°C for 30 
min and finally centrifuged to remove cell wall and membrane 
components. Dot blot analysis was performed using PspA-specif ic 
MAbs to validate the presence of recombinant, truncated PspA 
molecules in the lysates prior to their use as immunogens in 
mice- Unused lysate material was stored at -20°C until 
subsequent immunizations were performed. 
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Mouse immunization and challenge , CBA/CAHN-XID/J mice 
(Jackson Laboratories, Bar Harbor, Maine), 6-12 weeks old, were 
employed for protection studies. These mice carry a X-linked 
immunodeficiency that prevents them from generating antibody to 
polysaccharide components, thus making them extremely susceptible 
to pneumococcal infection. Animals were immunized subcutaneously 
with cell lysates from E coli recombinant strains RCT105, RCT113, 
RCT117, and RCT125 (Table 2) in complete Freund's adjuvant for 
primary immunizations. Secondary injections were administered in 
incomplete adjuvant and subsequent boosts in dH 2 0. Immunized and 
nonimiuunized mice (groups of 2 to 5 animals) were challenged with 
S. pneumoniae strains A66.3, BG7322, DBL6A, WU2, DBL5, BG9739, 
and L81905 intravenously (tail vein) to induce pneumococcal 
sepsis • Infected animals were monitored for 21 days and mice 
that survived the 3-week evaluation period were designated 
protected against death and scored as surviving 22 days for 
statistical analysis. Protection that resulted in extension of 
life was calculated as a comparison between mean number of days 
to death for immunized versus pooled control mice (non immunized 
and RCT125 sham- immunized; total of 6-7 animals) . 

Determination of PspA serum levels . Mice were bled 
retro-orbitally following the secondary boost and again prior to 
challenge. Representative mouse titers were evaluated by enzyme- 
linked immunorsorbent assay (ELISA) using native, parental PspAs 
isolated from pneumococcal strains DBL5 , BG9739, and L81905. 
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PspAs were immobilized on microtiter plates by incubating in 0.5 
NaHC0 3 , 0.5 M Na 2 C0 3 pH9.5 at 4°C overnight. Alkaline 
phosphatase-labeled goat anti-mouse immunoglobulin (Southern 
Biotechnology Associates, Inc.) was used to detect mouse serum 
antibodies. Color development was with p-nitrophenyl phosphate 
(Sigma, 1 mg/ml) in 0.5 m MgCL 2 pH 9.8 with 10% diethanolamine 
and absorbance was read at 405 nm after a 30 min incubation. 
Reciprocal titers were calculated as the last dilution of 
antibody that registered an optical density value of 0.1. Sera 
from individual mice within a particular immunogen group were 
evaluated separately and then the respective titers from four 
mice per group were combined to obtain titer range (Table 3). 

statistics. The one-tailed Fisher exact and two sample 
rank tests were used to evaluate protection against death and 
extension of life in the mouse model. 

Cloning of tru ncated pspA genes . Using primers N192 
and C588, truncated pspA genes from fifteen diverse pneumococcal 
strains representing eight different capsular types (Table l) 
were amplified by PCR. Even though variability exists in pspA 
genes from different strain, this result demonstrates that 
sufficient conservation exists between variant pspA genes to 
allow sequence amplification in all strains examined to date. 
Successful pspA PCR-amplif ication extended to all capsule types 
evaluated. 
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Fourteen of the amplified pspA genes were cloned and 

three clones containing truncated PspA molecules from 

pneumococcal strains DBL5, BG9739, and L81905 were further 

studies (Table 2) . To verify the constructions, plasmids from 

recombinant E. coli strains (RCT105, RCT113, RCT117, and RCT125 

(Table 2) were isolated, digested with Ndel and BAMHI restriction 

endonuc leases, and electrophoresed in 1% agarose side-by^side 

with the PCR products used in their respective constructions 

(Figure 1A) . The digestion reaction was complete for pRCT105, 

while pRCT113 and pRCT117 digestions were incomplete (lanes 5 and 

7, respectively). This gel was denatured and DNA transferred to 

nylon for Southern analysis. Figure IB depicts the corresponding 

Southern blot probed with full-length RxlpspA DNA. Lane 1 

contains pRCT125, digested vector alone, which does not react 

with the pneumococcal DNA-specific probe, as expected- The pspA- 

specific probe hybridized with the PCT products and the digested 

♦ 

plasmid inserts (see arrow. Figure IB) as well as the partially 
undigested pRCTH3 and pRCTH7 (lane 5 and 7), confirming 
successful cloning of DBL5, BG9739, and L81905 pspA DNA. 
Constructions were similarly confirmed with the eleven additional 
recombinant strains containing truncated pspA genes from S. 
pneumoniae strains of different capsular and PspA types. 

Expression of recombinant PspA in E. coli B121(De3) . 
Transformed E. coli strains RCT105, RCT113, RCT117, and RCT125 
were cultured to mid-exponential phase prior to the addition of 
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IPTG to induce expression of the cloned, truncated pspA gene in 
each strain. A cell fractionation experiment was performed to 
identify the location of recombinant PspA proteins in transformed 
E. coli strains. Samples representing uninduced cells, included 
cells (1 hr, 2 hr, and 3 hr time intervals), the periplasmic 
fraction, the cytoplasmic fraction, and insoluble cell 
wall /membrane material were resolved by SDS-PAGE. Proteins were 
then transformed to nitrocellulose and Western analysis was 
performed using monoclonal antibodies specific for PspA epitopes. 

Figure 2 reveals that both the cytoplasmic (lane 8) and 
the insoluble matter fractions (lane 9), from recombinant strain 
RCT 105, contain a protein of approximately 53. 7 'kDa that is 
recognized by MAb XiR278 that is not seen in the uninduced cell 
sample (lane 3) . This protein increases in quantity in direct 
correlation with the length of IPTG induction (lanes 4-6; l hr, 2 
hr, and 3 hr respectively) . No truncated RCT105 PspA was found 
in the periplasmics fraction (lane 7), which was expected since 
the pET-9a vector lacks a signal sequence that would be necessary 
for directing proteins to the periplasm. The observed molecular 
weight (ca. 53.5 kDa) is larger than the predicted molecular 
weight for the 1.2 kb DBLSpspA gene product (43.6 kDa; Figure 1A, 
lane 4). Like full-length Rxl PspA, the observed and predicted 
molecular weights for truncated PspAs do not agree precisely. In 
addition, immunoblot analysis was performed for recombinant E. 
coli strains RCT113, and RCT117 (using MAbs 1A4 and P50-92D, 
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respectively) and similar results were obtained, while no cell 
fractions from control strain RCT125 were recognized by MAb 
XiR278. 

Evaluating the prote ctive capacity o f recombinant 
truncate d PspAs . The truncated PspA proteins from strains 
RCT113, RCT117, and RCT105 were expressed and analyzed for their 
ability to generate cross-protection against a battery of seven 
S. pneumoniae strains. Control mice (non- immunized and RCT125 
sham- immunized) and recombinant PspA-immunized mice were 
challenged with mouse-virulent strains A66.3, BG7322, DBL6A, WU2, 
DBL5, BG9739, and L81905. Table 3 presents the day of death for 
each infected mouse. 

Immunization with truncated PspA from RCT113, RCT117, 
and RCT105 conferred protection against death for all mice 
challenged with capsular type 3 strains (A66.3 and WU2 (Table 3). 
The three truncated PspAs also provided significant protection 
against death with DBL6A, and BG7322 pneumococci (capsular types 
6A and 6B, respectively) . In addition, immunization with 
recombinant RCT113 PspA extended days to death in mice challenged 
with strains DBL5 , BG9739, and L81905, while RCT117 PspA 
prolonged the lives of mice inoculated with BG9739 pneumococci 
(Table 3) . Truncated BG9739 PspA elicited protection against all 
challenge strains (100%) evaluated in this study, while 
recombinant L81905 and DBL5 truncated PspAs conferred protection 
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against death with 71% and 57% of s. pneumoniae challenge 

strains , respectively . 

Anti-PspA antibody titers elicited by the three 

immunogens vary over approximately a 10-fold range (Table 3). 
The lowest antibody levels were elicited by RCT105 and this 
truncated PspA also elicited protection against the fewest number 
of challenge strains . RCT113 and RCT117 elicited three and nine 
time as much anti-PspA antibody, respectively. As expected, no 
antibody to PspA was detected in non immunized mice nor was 
specif ic-PspA antibody measured in mice immunized with the 
vector-only control strain (RCT125) . 

In summary, immunization with RCT113 and RCT117 PspAs 
protected mice against fatal challenge with capsular type 3 and 
6A strains and extended life for mice inoculated with type 4, 5, 
and 6B pneumococci. RCT105 PspA immunization protected against 
fatal infection with capsular type 3 and 6B strains and prolonged 
time to death for type 6A 5. pneumoniae but offered not 
protection against type 4 and 5 strains. These data demonstrate 
that truncated PspAs from capsular type 4 and 5 pneumococci 
collectively protect mice and ergo other hosts, such as humans, 
against or delay death caused by each of the seven challenge 
strains. In general, however, more complete protection was 
observed against strains of capsular type 3, 6A, and 6B than 
against type 4 and 5 S. pneumoniae. 
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PspA has been shown to be a protection-eliciting 
molecule of S. pneumoniae . Immunization with PspA has also been 
shown to be cross-protective, although eliciting more complete 
protection against certain strains than others. Thus, it is 
possible that a broadly protective PspA vaccine might need to 
contain PspAs of more than one pneumococcal strain. The distal 
third of the alpha-helical region of PspA has been identified as 
a major protective region of PspA. Moreover, this region is 
presented in a very antigenic form when expressed with the intact 
C-terminal half of the molecule. In this Example, the ability to 
use truncated PspA proteins homologous to the region of Rxl PspA 
extending from amino acid residue 192 to the C-terminus at 
residue 588 is demonstrated. 

The C-terminal two-thirds of PspA was cloned from 
fourteen strains by PCR amplification of a gene fragment of the 
appropriate size (1.2 kb) which hybridized with full-length Rxl 
pspA. Successful PCR amplification extended to all capsule types 
analyzed. Thus, the c-terminal two-third of PspA may be 
amplified from many, if not all, pneumococcal capsule types with 
Rxl pspA-specif ic primers. This technique is thus applicable to 
the development of antigenic immunological or vaccine 
compositions containing multiple PspA or fragments thereof. 

Of these clones, three truncated PspA proteins were 
expressed and evaluated in mouse immunization studies to 
determine their ability to cross-protect against challenge with a 
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variety of pneumococcal capsular types* All three recombinant 
PspAs elicited antibody reactive with their respective donor PspA 
and all three elicited protection against pneumococcal infection. 
Of the two truncated PspA proteins that elicited the highest 
antibody responses, 100% and 71% of the challenge strains were 
protected. RCT105 PspA, which elicited the lowest titers of 
PspA-specif ic antibody, yielded protection against 57% of S. 
pneumoniae strains evaluated. With all truncated PspAs, 
significant levels of protection were observed in four of the 
seven challenge strains. In fact, in all instances except for on 
(RCT1 05- immunized mice challenged with strain BG9739) the trend 
was for truncated PspA-immunization to elicit protection against 
pneumococcal challenge. These results demonstrate that truncated 
Rxl PspA (amino acids 192-588) cross-protects mice against fatal 

pneumoniae WU2 challenge. More importantly, these data show 
that the homologous regions of diverse PspAs demonstrate 
comparable cross-protective abilities. 

Strains of capsular type 4 and 5 were more difficult to 
protect against than were type 3, 6A and 6B pneumococcal strains. 
Serological differences in PspAs might affect cross-protection in 
some cases. Yet the difficulty in protecting against the type 4 
and 5 strains used herein could not be explained on this basis, 
since the truncated PspA immunogens were cloned from the same 
three type 4 and 5 strains used for challenge. Both PspAs from 
the type 4 strains delayed death caused by one or both type 4 
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challenge strains but neither could prevent death caused by 
either type 4 pneumococcal strain. Moreover, the truncated PspA 
from the type 5 strain DBL5 elicited protection against death or 
delayed death with strains of capsular types 3, 6A and 6B but 
failed to protect against infection with its donor strain or 
either type 4 challenge strain. 

There may be several reasons why the truncated PspAs 
from capsular type 4 and type 5 strains failed to protect against 
death even with their homologous donor S. pneumoniae strains. 
One possibility is that the type 4 and 5 strains chosen for study 
are especially virulent in the XID mouse model. XID mice fail to 
make antibodies to polysaccharides and are therefore extremely 
susceptible to pneumococcal infection with less than 100 CFU of 
most strains, including those of capsular type 3, 4, 5, 6A, and 
6B. The increased mouse virulence of types 4 and 5 is apparent 
from the fact that in immunologically normal mice these strains 
have lower LD 50 s and/ or are more consistently fatal than strains 
of capsular types 3, 6A, or 6B- 

Another possibility is that epitopes critical to 
protection-eliciting capacity with capsular type 4 and 5 strains 
are not present in the C-terminal two-thirds of PspA (amino acids 
192-588) , the truncated fragments used for immunization. The 
critical epitopes for these strains may be located in the N- 
terminal two thirds of the alpha-helical region of their PspA 
molecules. Finally, it is also possible that PspA may be less 
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exposed on some S. pneumoniae strains than others. Strain Rxl 
PspA amino acid sequence does not contain the cell wall 
attachment motif LPXTGX described by Schneewind et al. found in 
many gram-positive bacteria. Rather, PspA has a novel anchoring 
mechanism that is mediated by choline interactions between 
pneumococcal membrane-associated lipoteichoic acid and the repeat 
region in the C-terminus of the molecule. Electron micrographic 
examination has confirmed the localization of PspA on the 
pneumococcal surface and PspA-specif ic MAb data supports the 
accessibility of surface-exposed PspA. However, it is not known 
whether S. pneumoniae strains differ substantially in the degree 
to which different PspA regions are exposed to the surrounding 
environment. Nor is it known if the quantity of PspA expressed 
on the bacterial cell surface differs widely between strains. 
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Table 1. gspA recombinant strains categorized by pneumococcal 
capsular type. 



Capsular 
Type 



Parent 
Strains 



Respective 
Recombinant Strains 



3 


WU2, EF10197 


RCT111, RCT137 


4 


BG9739, EF5668 


RCT113, RCT115 




L81905, EF3296 


RCT117, RCT133 


5 


DBL5 


RCT105 


6A 


DBL6A, EF6796 


RCT109, RCT135 


6B 


BG9163, DBL1 


RCT129, RCT131 


14 


TJ0893 


none* 


19 


BG8090 


RCT121 


23 


0922134, BG8743 


RCT119, RCT123 



Truncated pgpA amplified recently, not yet cloned 
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Table 2. Description of recombinant strains used in evaluating the 
protection-eliciting capacity of truncated PspAs in mice. 

Recombinant Capsule Type 

Strain Description of Parent PspA 



RCT 105 BL21(DE3) E^coli with pET-9a:DBL5 5 

RCT 113 BL21(DE3) E. coli with pET-9a:BG9739 4 

RCT 117 BL21(DE3) E. coli with P ET-9a:L81905 4 

RCT 125 BL21(DE3) E. coli with pET-9a (vector only) 
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EXAMPLE 2 - Localization of protection-eliciting epitopes 

and PspA of S. pneumoniae 

This Example, the ability of PspA epitopes on two PspA 
fragments (amino acids 192-588 and 192-299) to elicit cross- 
protection against a panel of diverse pneumococci is 
demonstrated. Also, this Example identifies regions homologous 
to amino acids 192-299 of Rxl in 15 other diverse pneumococcal 
strains. The DNA encoding these regions was then amplified and 
cloned. The recombinant PspA fragments expressed were evaluated 
for their ability to elicit cross-protection against a panel of 
virulent pneumococci. 

Bacterial stra ins and media conditions , s. pneumoniae 
strains were grown in Todd Hewitt broth with 0.5% yeast extract 
(THY) (both from Difco Laboratories, Detroit, Michigan) at 37 °C 
or on blood agar plates containing 3% sheep blood at 37 °C under 
reduced oxygen tension. E. coli strains were grown in Luria- 
Bertani medium or minimal E medium. Bacteria were stored at - 
80 °C in growth medium supplemented with 10% glycerol. E. coli 
were transformed by the methods of Hanahan (Hanahan, D. J. Mol. 
Biol. 1983; 166: 557). Ampicillin (Ap) was used at a 
concentration of 100 Mg/ml for E. Coli. 

Construction of pIN-III-ompA3 and pMAL-based E. Coli 
strains. Recombinant plasmids pBClOO and pBAR416 that express 
and secrete pspA fragments from £. Coli were constructed with 
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pIN-III-ompA3 as previously described (McDaniel, L.S. et al., 
Microb. Pathog. 1994; 17: 323). 

The pMAL-p2 vector (New England Biolabs, Protein Fusion 
& Purification System, catalog #800) was used for cloning pspA 
gene fragments to amino acids 192-299 from strain Rxl and from 7 
other S. pneumoniae strains: R36A, D39, A66, BF9739, DBL5, 
DBL6A, and LM100. Amplification of the pspA gene fragments was 
done by the polymerase chain reaction (PCR) as described 
previously (McDaniel, L.S. et al., Microb. Pathog. 1994; 17: 323) 
using primers 5 ' CCGGATCCGCTCAAAGAGATTGATGAGTCTG3 ' [LSM4] and 
5 ' CTGAGTCGACTGAGTTTCTGGAGCTGGAGC3 ' [LMS6J made with BamHI and 
Sail restriction endonuclease sites, respectively. Primers were 
based on the sequence of Rxl PspA. PCR products and the pMAL 
vector were digested with BAMHI and Sail, and ligated together. 
Clones were transformed into E. Coli DH5a by the methods of 
Hanahan. Stable transf ormants were selected on LB plates 
containing 100/xg/xnl Ap. These clones were screened on LB plates 
containing 0.1 mM IPTG, 80 jig/ml X-gal and 100 ^iq /ml Ap and 
replica LB plates with 100 Mg/ml Ap according to the 
manufacturer's instructions. The strain designations for these 
constructs are listed in Table 6. Positive clones were evaluated 
for the correct pspA gene fragment by agarose gel electrophoresis 
following plasmid isolation by the methods of Birnboim and Doly 
(Birnboim, H.C. et al., Nucl. Acids Res. 1979, 7: 1513). 
Southern analysis was done as previously described using a full- 
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length pspA probe (McDaniel, L.S. et al., Microb. Pathog* 1994; 
17 : 323) , randomly primed labeled with digoxigenin~ll-dUTP 
(Genius System, Boehdinger Mannheim, Indianapolis, Indiana) and 
detected by chemi luminescence. 

Expression of recombinant PspA protein fragments . For 
induction of expression of strains BClOO.and BAR416, bacteria 
were grown to an optical density of approximately 0.6 at 660 nm 
at 37°C in minimal media, and IPTG was added to a final 
concentration of 2 raM. The cells were incubated for an 
additional 2 hours at 37°C, harvested, and the periplasmic 
contents released by osmotic shock. For strains BAR36A, BAR39, 
BAR66, BAR5668, BAR9739, BARL5, BAR6A and BAR100, bacteria were 
grown and induced as above except LB media + 10 mM. glucose was 
the culture medium. Proteins from these strains were purified 
over an amylose resin column according to the manufacturer's 
instructions (New England Biolabs, Protein Fusion & Purification 
System, Catalog #800) . Briefly, amylose resin was poured into a 
10 mL column and washed with column buffer. The diluted osmotic 
shock extract was loaded at a flow rate of approximately 1 
mL/minute. The column was then washed again with column buffer 
and the fusion protein eluted off the column with column buffer 
containing 10 mM maltose. Lysates were stored at -20°C until 
further use. 

Characterization of truncated PspA proteins used for 
immunization > The truncated PspA molecules, controls and 
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molecular weight markers (Bio-Rad, Richmond, CA) were 
electrophoresed in a 10% sodium dodecyl (SDS) - polyacrylamide 
gel and electroblotted onto nitrocellulose. Rabbit polyclonal 
anti-PspA serum and rabbit antimaltose binding protein were used 
as the primary antibodies to probe the blots. 

A direct binding ELISA procedure was used to 
quantitatively confirm reactivities observed by immunoblotting. 
For all protein extracts, osmotic shock preparations were diluted 
to a concentration of 3 Mg/ml in phosphate buffered saline (PBS) , 
and 100 /il was added to the wells of Immulon 4 microtitration 
plates (Dynatech Laboratories, Inc., Chantilly, VA) * After 
blocking with 1.5% bovine serum albumin in PBS, unf ractionated 
tissue culture supernates of individual MAbs were titered in 
duplicated by three-fold serial dilution through seven wells and 
developed using an alkaline phosphatase-labeled goat anti-mouse 
immunoglobulin secondary antibody (Southern Biotech Associates, 
Birmingham, AL) and alkalinephosphatase substrate (Sigma, St* 
Louis, MO) . The plates were read at 4 05 nm in a Dynatech plate 
reader after 25 minutes, and the 30% end point was calculated for 
each antibody with each preparation. 

Immunization and Protection Assays , Six to nine week 
old CBA/CAHN-XID/J (CBA/N) mice were obtained from the Jackson 
Laboratory, Bar Harbor, Maine. CBA/N mice carry an X-linked 
immunodeficiency trait, which renders them relatively unable to 
respond to polysaccharide antigens, but they do respond with 
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normal levels of antibodies against protein antigens. Because of 

the absence of antibodies reactive with the phosphocholine 

determinant of C-polysaccharide in their serum, the mice are 

highly susceptible to pneumococcal infection. Mice immunized 

with the BC100 fragment were injected inguinally with antigen 

emulsified in CFA, giving an approximate dose of 3 /xg of protein 

per mouse. Fourteen days later the mice were boosted 

intraperitoneally with 3 /zg of antigen diluted in Ringer's 

lactate without adjuvant. Control mice were immunized following 

the same protocol with diluent and adjuvant, but no antigen. 

Mice immunized with the BAR416 fragment were injected with 0.2 ml 

at two sites in the sublinguinal area with antigen emulsified in 

CFA. The mice were boosted inguinally fourteen days later with 

antigen emulsified in I FA and were boosted a second time fourteen 

days later intraperioneally with 0.2 ml of antigen diluted in 

Ringer's lactate without adjuvant. 
* 

Mice that were immunized with the homologues of Rxl 
BAR416 were immunized as described above. The control animals 
followed the same immunization protocol but received maltose 
binding protein (MBP) diluted 1:1 in CFA for their immunization 
and were also boosted with MBP. 

Serum analysis. Mice were retro-orbitally bled with a 
75 ^1 heparinized microhematocrit capillary tube (Fisher 
Scientific) before the first immunization and then once 
approximately 2 hours before challenge with virulent pneumococci. 
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The serum was analyzed for the presence of antibodies to PspA by 
an enzyme- linked immunosorbent assay (ELISA) using native full^ 
length R36A PspA as coating antigen as previously described 
(McDaniel, L.S. Microb. Pathog. 1994; T&i 323). 

Intravenous infection of mice . Pneumococcal cultures 
were grown to late log phase in THY. Pneumococci were diluted to 
10 4 CFU based on the optical density at 420 ran into lactated 
Ringer's solution. Seven days following the last boost injection 
for each group, diluted pneumococci were injected intravenously 
(tail vein) in a volume of 0.2 ml and plated on blood agar plates 
to confirm the numbers of CFU per milliliter. The final 
challenge dose was approximately 50-100 times the LD 50 of each 
pneumococcal strain listed in Tables 4-6. The survival of the 
mice was followed for 21 days. Animals remaining alive after 21 
days were considered to have survived the challenge. 

Statistical analysis . Statistical significance of 
differences in days to death was calculated with the Wilcoxon 
two-sample rank test. Statistical significance of survival 
versus death was made using the Fisher exact test. In each case, 
groups of mice immunized with PspA containing preparations were 
compared to unimmunized controls, or controls immunized with 
preparations lacking PspA. One-tailed, rather than two-tailed, 
calculations were used since immunization with PspA or fragments 
of PspA has never been observed to cause a statistically 
significant decrease in resistance to infection. 
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Cloning into pMAL vector . Using primers based on the 
sequence of Rxl PspA, LSM4 and LSM6, pspA gene fragments were 
amplified by PCR from fifteen out of fifteen pneumococcal strains 
examined. Seven of the eleven gene fragments were cloned into 
pMAL-p2 and transformed into E. coli (Table 6) . The correct 
insert for each new clone was verified by agarose gel 
electrophoresis and Southern hybridization analysis. Plasmids 
from recombinant E. coli strains BAR36A, BAR39 , BAR66, BAR9739, 
BARL5, BAR6A and BAR100 were isolated, digested with BamUl and 
Sail restriction endonucleases and electrophoresed on a 0.7% TBE 
agarose gel. The gel was then denatured and the DNA transferred 
to a nylon membrane for southern hybridization. The blot was 
probed with full-length Rxl pspA DNA at high stringency 
conditions. The cloning of R36A, D39 r A66, BG9739, DBL5, DBL6A 
and LM100 pspA DNA into pMal-p2 was confirmed by the recognition 
of all BamHI and Sail digested DNA inserts by the Rxl probe. 

Expression and conformation of truncated recombinant 
proteins . The transformed E. coli strains BAR3 6 A , BAR39, BAR66, 
BAR9739 r BARL5 , BAR6A and BAR100 were grown in LB media 
supplemented with 10 mM glucose and induced with 2 mM IPTG for 
expression of the truncated PspA protein fused with maltose 
binding protein. Transformed E. coli strains BC100 and BAR416, 
which express PspA fragments fused to the OmpA leader sequence in 
the pIN-III-ompA3 vector, were grown in minimal medium and 
induced with 2 mM IPTG for expression. Both vectors, pIN-III- 
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ompA3 and pMal-p2, are vectors in which fusion proteins are 
exported to the periplasmic space. Therefore, an osmotic shock 
extract from the pMal-p2 containing bacteria was then run over an 
amylose column for purification and resolved by SDS-PAGE western 
blotting. The western blot of the protein extracts from BAR36A, 
BAR39, BAR66, BAR9739 r BARL5 , BAR6A and BAR100 were recognized by 
a rabbit polyclonal antibody made to strain BC100 PspA. The 
apparent M r of full-length PspA from WU2 is 91.5 kD. The M r of 
maltose binding protein is 42 kD and the expected M r for the PspA 
portion of the fusion is 12 kD. All extracts exhibited molecular 
weights that ranged from 54 to 80 kD. This range of molecular 
weights can be attributed to the variability of pspA among 
different pneumococcal strains. An ELISA, with plates coated 
with the various cloned fragments quantitatively confirmed the 
reactivities that were observed in the western blots with all 
protein extracts. 

Protection and cross-protection against fatal 
pneumococcal infection elicited bv cloned PspA fragments . CBA/N 
mice were immunized with the truncated PspA fragment encoded by 
pBClOO, which is composed of amino acids 192 to 588 of Rxl PspA, 
and challenged with 13 different S. pneumoniae strains 
representing 7 different capsular types (Table 4) ♦ With all 13 
strains, the immunization resulted in protection from death or an 
extended time to death. With 10 of the strains the difference 
was statistically significant. With strains of capsular types 3, 
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6A, and 6B, all immunized mice were protected against death ♦ 
Although there were fewer survivors in the case of capsular types 
2, 4, and 5, the immunization with BC100 resulted in significant 
increases in times to death. 

The BC100 immunization studies made it clear that 
epitopes C-terminal to residue 192 could elicit cross-protection* 
The BAR416 fragment, which includes amino acids 192-299, could 
elicit protection from fatal infection with a single challenge 
strain WU2. This Example shows the ability of BAR416 
immunization to protect against the 6 strains that had been best 
protected against by immunization with BC100. Immunization with 
the BAR416 construct resulted in increased time to death for all 
6 challenge strains examined (Table 5) . BAR416 provided 
significant protection against death with WU2, A66, BG7322 and 
EF6796 pneumococci (capsular types 3, 3, 6B and 6A respectively). 
It also prolonged the lives of mice challenged with ATCC63 03 and 
DBL6A pneumococci (capsular types 3 and 6A respectively) . Serum 
from mice immunized with the BAR416 fragment yielded a geometric 
mean reciprocal anti-PspA ELISA titer to full-length Rxl PspA of 
750. Mice immunized with BC100 had geometric mean reciprocal 
titers of close to 2000, while non-immunized mice had anti-PspA 
titers of <10. 

The above data indicates that the BAR416 fragment from 
Rxl elicits adequate cross-reactive immunity to protect against 
diverse pneumococci and suggests that this region must be 
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serologically conserved among PspAs. This hypothesis was 
confirmed by immunized with recombinant BAR416 homologous regions 
from the 7 different clones and then challenging with strain WU2 
(Table 6) . All 7 immunogens elicited significant protection. 
PspA fragments from capsular types 2 and 22 and the rough R36A 
strain elicited complete protection against death with all 
challenged mice. All of the other immunogens were able to extend 
the time to death of all the mice with the median days to death 
being 21 days or >21 days. Serum from mice immunized with the 
BAR416 homologous fragments had anti-PspA reciprocal titers that 
ranged from 260 to 75,800 with an average of 5700 while control 
animals immunized with only maltose binding protein had anti-PspA 
reciprocal titers of <10. 

Antibody reactivities . All of the above immunization 
studies attest to the cross-reactivity of epitopes encoded by 
amino acids from position 192-299. This region includes the c- 
terminal third of the a-helical region and the first amino acids 
of the proline rich region. Other evidence that epitopes within 
this region are cross-reactive among different PspAs comes form 
the cross-reactivity of a panel of nine MAbs all of which were 
made by immunization with Rxl PspA. The epitopes of four of the 
antibodies in the panel reacted with epitopes mapping between 
amino acids 192-260. The epitopes of the other five MAbs in the 
panel map between amino acids 1 and 115 (McDaniel, L.S., et al., 
Microb. Pathog. 1994; 17: 323). Each of these 9 MAbs were tested 
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for its ability to react with 8 different PspAs in addition to 
Rxl* The 5 MAbs whose epitopes were located within the first 115 
amino acids, reacted on average with only l other PspA. Three of 
the 5 in fact, did not react with any of the other 8 PspAs. In 
contrast the MAbs whose epitopes map between 192 and 260 amino 
acids each cross-reacted with an average of 4 of the 8 non-Rxl 
PspAs, and all of them reacted with at least two non-Rxl PspAs . 
Thus, based on this limited section of individual epitopes, it 
would appear that epitopes in the region from 192-260 amino acids 
are generally much more cross-reactive than epitopes in the 
region from 1-115 amino acids. 

The BC100 fragment of Rxl PspaA can elicit protection 
against the encapsulated type 3 strain WU2 . Although the PspAs 
of the two strains can be distinguished serologically they are 
also cross-reactive (Crain, M.J W et al. # Infect. Immun. 1990; 
58: 3293). The earlier finding made it clear that epitopes 
cross-protective between Rxl and WU2 PspAs exist* The importance 
of cross-reactions in the region C-terminal to residue 192 is 
demonstrated in this Example where 13 mouse virulent challenge 
strains have been used to elicit detectable protection against 
all of them. The first indication that epitopes C-terminal to 
position 192 might be able to elicit cross-protection came from 
our earlier study where we showed the MAbs Xi64, XiR278, XiR1323, 
and XiR1325, whose epitopes mapped between amino acids 192 and 
260 of strain Rxl PspA, could protect against infection with WU2. 
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Moreover, immunization with PspA fragments from 192-588 and 192- 
299 were able to elicit protection against infection against WU2. 
This Example shows that the BC100 Rxl fragment (192-588) elicits 
significant protection against each of 13 different mouse 
virulent pneumococci, thereby firmly establishing the ability of 
epitopes C-terminal to position 192 to elicit a protective 
response. The observation that a fusion protein containing amino 
acids 192-299 fused C-terminal to maltose binding protein could 
also elicit cross-protection, permits the conclusion that 
epitopes in this 107 amino acid region of PspA are sufficient to 
elicit significant cross-protection against a number of different 
strains. 

Evidence that a comparable region of other PspAs is 
also able to elicit cross-protection cam from the studies where 
sequences homologous to the 192-299 region of Rxl PspA were made 
from 5 other PspAs. All 5 of these fragments elicited 
significant protection against challenge with strain WU2. These 
data provide some suggestion for serologic differences in cross- 
protection elicited by the 192-299 region. 

Based on present evidence, without wishing to be bound 
by any one particular theory, it is submitted that the PspAs in 
strains D39, Rxl and R36A are identical. All of the 9 mice 
immunized with the 192-299 fragments from R36A and D39 survived 
challenge with WU2- Only LM100, one of the non-R36A/D39 PspAs, 
protected as high a percentage of mice from WU2. The difference 
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in survival elicited by the R36A/D39 PspAs and the non-Rxl 
related PspAs was statistically significant. 

The data does indicate however, that all of the 
differences in protection against different strains are not due 
to differences in serologic cross-reactivity. BC100, which is 
made from Rxl, protected against death in 100% of the mice 
challenged with 7 different strains of S. pneumonia, but only 
delayed death with strain D39, which is thought to have the same 
PspA as strain Rxl. Thus # some of the differences in cross- 
protection are undoubtedly related to factors other than PspA 
cross-reactivity. Whether such factors are related to 
differences in virulence of the different strains in the 
hypersuceptible Xid mouse, or differences in requirements for 
epitopes N-terminal to amino acid 192, or difference in the role 
of PspA in different strains is not yet known. 

These results suggest that a vaccine containing only 
the recombinant PspA fragments homologous with Rxl amino acids 
192-299 is effective against pneumococcal infection. Moreover, 
the results demonstrate that utility of PspA a. a. 192-299, a. a. 
192-260 and DNA coding therefor, e.g. primers N192 or 588 
(variants of LSM4 and LSM2) as useful for detecting the presence 
of pneumococciae by detecting presence of that which binds to the 
amino acid or to the DNA, or which is amplified by the DNA, e.g., 
by using that DNA as a hybridization probe, or as a PCR primer, 
or by using the amino acids in antibody-binding kits, assays or 
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tests; and, the results demonstrate that a. a. 192-299 and a. a. 
192-260 can be used to elicit antibodies for use in antibody- 
binding kits assays or tests* 
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Tatil$ 'Protection of mice by immunization with BC100 from Rxl FspA 



Challenge Capsule FspA 


BClOOImmunogen 


Controls 


p 


Strain* 


type 


type 


£ at iff* 

» alive 

/Sdead 


Survival 


JsU 1 UOT9 

alive 


« alive 
/ 1 dead 


% 

w 9 

Survival 


Median davs 
altve 


value* 


D39 


2 


25 


0/5 


0% 


5 


0/3 


0% 


2 


0.02 


WU2 


3 


1 


4/0 


100% 


>21 


0/3 


0% 


3 


aoo2 


ATCC6303 


3 


7 


5/0 


100% 


>21 


0/5 


0% 


7 


aoo4 


A66 


3 


13 


4/0 


100% 


>21 


0/3 


0% 


I 


om 


EF10197 


3 


18 


5/0 


100% 


>21 


0/3 


0% 


2 


0.02 




4 


12 


1/3 


25% 


9 


0/3 


0% 


4 


N.S. 


EF3296 


4 


20 


1/3 


25% 


5 


0/3 


0% 


3 


N.S. 


L81905 


4 


23 


1/4 


20% 


4 


0/6 


0% 


2 


0.02 


BG9739 


4 


26 


0/4 


0% 


6.5 


0/3 


0% 


2 


N.S. 


DBL5 


5 


33 


0/5 


0% 


5 


0/3 


0% 


2 


0.02 


BC7322 


6 


24 


4/0 


100% 


>21 


1/2 


333% 


6 


0i)3 


EF6796 


6A 


1 


4/0 


100% 


>21 


0/3 


0% 


1 

7 


0,03 


DBL6A 


6A • 


19 


5/0 


100% 


>21 


0/3 


0% 


0.03 



* Mice were challenged with approximately 10 3 CFU/xnL of each strain 

5 P values were based on comparison of days alive by a one-tailed Wilcoxon 2 sample-rank 
test 



ladfifprotection of mice by immunization with BAR416 from Rxl PspA 



Challenge Capsule 


PspA 


BAR416Inununogen 


Controls 


P 


Strain 


type 


type 


% alive 
/*dead 


% 

Survival 


Median days 
alive 


? alive 
/*dead 


% 

Survival 


Median days 
alive 


Value 5 


VVU2 


3 


1 


4/1 


80% 


>21 


0/3 


0% 


1 


0.002 


ATCC6303 


3 


. 7 


2/3 


40% 


13 * 


1/4 


20% 


4 


0.048 


A66 


3 


13 


5/0 


100% 


>21 


0/5 


0% 


2 


0.004 


BC7322 


6 


24 


3/2 


60% 


>21 


0/4 


0% 


7 


0.02 


EF6796 


6A 


1 


3/2 " 


60% 


>21 


0/5 


0% 


» 
a 


0.004 


T>BT.6A 


6A 


19 


0/5 


0% 


7 


0/5 


0% 


2 


0.008 



Note, nice were challenged with about 10 3 CFU of each strain 

S P values were based on comparison of days alive by a one-tailed Wilcoxon 2 sample-rank 
test 
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Table g Protection of mice against S. pneumoniae WU2 by immunization with 
BAR416 Analogs of 7 PspAs 



Immunogen 


Parent 
Strain 


Capsule 
type 


PspA 
type 


S alive 
/ total 2 


% 

Survival 


Median days 
alive 


P value* 
vs. MBP 


BAR36A 


R36A 




25 


4/4 


100% 


>21 


0.002 


BAR39 


D39 


2 


25 


5/5 


100% 


>21 


0.0008 


BAR66 


A66 


3 


13 


7/8 


88% 


>21 




BAR9739 


BG9739 


4 


26 


5/8 


63% 


>21 




BARL5 


DBL5 


5 


33 


4/8 


50% 


21 


0.03 


BAR6A 


DBL6A 


6A 


19 


3/5 


60% 


>21 


0.05 


BAR100 


LM100 


22 


ND 


5/5 


100% 


>21 


0.0008 


MBP 


0/8 


0% 


2 





* P values were based on comparison of days alive by a one-tailed Wilcoxon 2 sample-rank 
test 



Note, the PspA fragments used for immunization were doned from products amplified 
with primers LSM4 and L5M6. In addition to the strains listed above, PCR reactions with 
LSM4 and LSM6 amplified products of the appropriate size from strains BG9163, WU2, 
L81905, EF6796, EF5668, BG7376, BG7322, and BG5-8A. 



Iahlfi^* Reactivity of MAbs with PspAs of Different Pneumococci 



Donor of test PspA 



MAb mapping to 1-11 5 amino acids MAb mapping to 192-260 amino acids 



Strain Capsule PspA 
Type Type 



Xil26 XiR1224 XiR1526 XiR35 XiR16 
IgC2b IgM IgG2b IgC2a IgG2a 



XiR1323 Xi64 XiR278 XiR1325 
IgM IgA IgCl IgC2a 



Rxl 


rough 


25 


ATCC101813 


3 


3 


EF10197 


^ 

.? 


18 


BC9739 


4 


26 

* 


L81905 


4 


23 


BC-5-8A 


6A 


0 


BG9163 


68 


21 


LM100 


22 


M.D. 


WU2 


3 


1 



++ 



++ 



Note, immunoblot analysis was carried out with the nine MAbs from this study against a 
panel of nine different pneumococcal strains. Rxl served as a positive control. The results 
are presented as ++ (strong reaction), + (weak, but clearly positive reaction), +/ - (difficult 
to detect), and - (no reaction).The PspA of all strains gave a positive reaction ivith rabbit 
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EXAMPLE 3 - Isolation of PspA and Truncated Forms 

Thereof, and Immunization Thereby 

PspA is attached to the pneumococcal surface through a choline binding site on PspA. 
This allows for successful procedures for the isolation of FL-PspA. PspA can be released from 
the surface of pneumococci by elution with 2 percent choline chloride (CC), or by growth in a 
chemically defined medium (CDM) containing 12 percent CC (CDM-CC) or medium in which 
the choline had been replaced by ethanolamine (CDM-ET). Since CDM-ET supernatants lack 
high concentrations of choline, the PspA released into them can be adsorbed to a choline (or 
choline analog) column and isolated by elution from the column with 2 percent choline chloride 
(CQ. 

This Example describes the ability to obtain PspA by these procedures, and the ability of 
PspA obtained by these procedures to elicit protection in mice against otherwise fatal 
pneumococcal sepsis. Native PspA from strains R36A, RX1, and WU2 was used because these 
strains have been used previously in studies of the ability of PspA to elicit protective immunity 
(jee, e.g. f Examples infra and supra) . The first MAbs to PspA were made against PspA from 
strain R36A and the first cloned fragments of PspA and PspA mutants came from strain Rxl. 
Strain Rxl was derived from strain R36A, which was in turn derived from the encapsulated type 
2 strain, D39. PspAs from these three strains appears to be indentical based on serologic and 
molecular weight analysis. Molecular studies have shown no differences in the pspA genes of 
strains D39, Rxl, and R36A. The third strain that provided PspA in this Example is the mouse 
virulent capsular type 3 strain WU2. Its PspA is highly cross-reactive with that from R36A and 
Rxl, and immunization with Rxl and D39 PspA can protect against otherwise fatal infections 
with strain WU2. 

5- pneumoniae 

Strains of S. pneumoniae used In this study have been described previously (T able & h Bacteria were 
grown in either Todd-Hewitt broth with 0.5 percent yeast extract (THY), or a chemically defined medium 
(CDM) described previously ^3_ Serial passage of stock cultures was avoided. Strains were maintained 
frozen in THY + 20 percent glycerol and cultured from a scraping of the frozen culture. 
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Recovery of PspA from pneumococri 

PspA is not found in the medium of growing pneumococci unless they have reached stationary phase 
and autolysis has commenced 3<\ To release PspA from pneumococci three procedures were used. In one 
approach were grow pneumococci in 100 ml of THY and collect the cells by centrifugation at mid-log phase. 
The pellet was washed three times in lactated Ringer's solution (Abbot Lab. North Chicago, IL). suspended in a 
small volume of 2 percent choline chloride in phosphate buffered saline (PBS) (pH 7.0), incubated for 10 
minutes at room temperature, and centrifuged to remove the whole pneumococci. From immunoblots with anti- 
PspA MAb XiI26 48 at serial dilutions of the original culture, the suspended pellet, and the supernatant, it was 
evident that this procedure released about half of the PspA originally present on the pneumococci. Analysis of 
silver stained polyacrylamide gels showed this supernatant to contain proteins in addition to PspA 3 *\ 

The CDM used in the remaining two procedures was modified from that of Van der Rijn 43 . For normal 
growth it contained 0.03% CC To cause PspA to be released during bacterial growth, the pneumococci were 
grown in CDM containing 1.2 percent choline chloride (CDM-CQ, or in CDM containing 0-03 percent 
cthanolamine and only 0.000 t 001 percent choline (CDM-ET). In media lacking a normal concentration of 
choline the F-antigen and C-polysaccharide contain phosphoethanolamine rather than phosphocholine 49 . In 
CDM-CC and CDM-ET, PspA is released from the pneumococcal surface because of its inability to bind to the 
cholines in the lipoteichoic acids 36 . In addition to releasing PspA from the pneumococcal surface, growth in 
CDM-CC or CDM-ET facilitates PspA isolation by its other effects on the cell wall. In these media 
pneumococci do not autolyse 49 , thus permitting them to be grown into stationary phase to maximize the yield 
of PspA. In these media scptation does not occur and the pneumococci grow in long chains 36 * 49 . As the 
Pyococci reach stationary phase fccy d,v. cease making ftpA, Md rapidJy setUe 0UL ^ 
usu-g senal dilution dot blots to quantity ftpA. indicated that the production of PspA ceases at about the time 
the pneumococci begin ,o settle out, with the formation of visible strands of the condensed pneumococcal 
chains. When the pneumococci began to settle ow, the medium was recovered by centrifugation a, 2900 x g 
for 20 minutes, and filtered with a low protein-binding filter (.45 M Nalgene Tissue Culture Rlter * ,58-0045) 
For grow* in CDM-CC or CDM-ET, the pneumococci were first adapted to the defined medium and 
then, m the case of CDM-ET, to very low choline concentrations. To do this, strains were first inoculated into 1 
pan of THY and 9 parts of CDM medium containing 0.03 percent choline and 0.03 percent ethanolamine. 
After two subsequent subcultures in CDM containing 0.03 percent choline and 0.03 percent ethanolamine (0.1 
ml of culture + 0.9 ml of pre-warmed fresh medium), the culture was used to inoculate CDM with only 0 003 
percent choline (and 0.03 percent ethanolamine). These steps was repeated until the strain would grow in 
CDM-ET containing 0.000.001 percent choline and 0.03 percent examine. It was critical mat cultures be 
passed whtle in exponential growth phase (a, about 10? CFU/mi). Even trace contamination of the medium by 
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exogenous choline resulted in the failure of the PspA to be released from the pneumococcal surface 3* Thus, 
disposable plastic ware was used for the preparation of CDM-ET media and for growth of cultures. Once a 
strain was adapted to CDM-ET it was frozen in 80 percent CDM-ET and 20 percent glycerol at -SO 0 C When 
grown subsequently the strain was inoculated directly into the CDM-ET. 

Isolation of native (full-length) PspA 

PspA was isolated from the medium of cells grown in CDM-ET using choline-Sepharose prepared by 
conjugating choline to epoxy-activated Sepharose 50. a separate column was used for media from different 
strains to avoid cross-contamination of their different PspAs. For isolation of PspA from clarified CDM-ET, we 
used a 0.6 ml bed volume of choline-Sepharose. The column bed was about 0.5 cm high and 1 .4 cm in diameter. 
The flow rate during loading and washing was approximately 3 ml/min. After loading 300 ml CDM-ET 
supernatant, the column was washed 10 times with 3 ml volumes of SO mM Tris acetate buffer. pH 6.9 
containing 0.25 M NaCl (TAB). The washed column was eluted with sequential 3 ml volumes of 2 percent CC 
in TAB. Protein eluted from the column was measured (Bio-Rad protein assay, Bio-Rad, Hercules. CA). The 
column was monitored by quantitative dot blot. The loading material, washes, and the eluted material were dot 
blotted (Ijil) as undiluted, 1/4, 1/16. 1/64, 1/256, and 1/1024 on nitrocellulose. The membranes were then 
blocked with 1 percent BSA in PBS. incubated for 1 hr with PspA-specific MAbs Xil26 or XiR278, and 
developed with biotinylated goat-anti-mpuse Ig, alkaline phosphatase conjugated streptavidin (Southern 
Biotechnology Associates Inc. Birmingham, AL), and nitrobluetetrazolium substrate with 5-bromo 4-chIoro-3- 
indoyl phosphate p-toluidine salt (Fisher Scientific. Norcross GA) The purity of eluted PspA was assessed 

by silver-stained (silver stain kit. Bio Rad, Hercules. CA) SDS-PAGE gels run as described previously 32 . 

* 17 

Immune-blots of SDS-PAGE gels were developed wilh MAbs Xil26 and XiR278 - 

Isolation of 29 IcDa PspA 

The 29 kDa fragment comprising the N-terminaf 260 amino acids of PspA was produced in DH1 £ coli 
from pJY4306 An overnight culture of JY4306 was grown in 100 mi of Una Broth (LB) containing 

50ug/ml ampiciUin. The culture was grown at 37' C in a shaker at 225 rpm. This culture was used to inoculate 
6 one liter cultures that were grown under the same conditions. When the culture O-D. at 600 nm reached 0.7. 
12 grams of cells, as a wet paste, were harvested at 4° C at 12.000 xg. The pellet was washed in 10 volumes of 
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25 mM Tris pH 7.7 at 0° C and suspended in 600 ml of 20% sucrose, 25 mM Tris pH 7.7 with 1 0 mM 
ethylenediamine tetraacetic acid (EDTA) for 10 minutes. The cells were pelleted by centrifugation (8000 xg) 
and rapidly suspended in 900 ml of I percent sucrose with 1 mM Pcfabloc SC hydrochloride (Bcehringer 
Mannheim Corp., Indianapolis, IN.) at 0° C. The suspension was pelleted at 8000 xg at 4° C for 15 minutes and 
the PspA-containing supernatant (periplasmic extract) 51 recovered. The recombinant PspA was precipitated 

from the periplasmic extract by 70 percent saturated ammonium sulfate overnight at 4° C The precipitated 
material was collected by centrifugation at 12,000 xg at 4° C for 30 minutes. The precipitated protein was 
resuspended in 35 ml of 20 mM histidine 1 percent sucrose at pH 6.6 (HSB). Insoluble materials were removed 
at 1,000 xg at 4° C for 10 minutes. The clarified material was dialyzed versus HSB, passed through a 02\xm 
filter and further purified on a 1 ml MonoQ HR 5/5 column (Pharmacia Biotech, Inc., Piscataway, N J) 
equilibrated with HSB. The clarified material was loaded on the column at 1 ml/min t and the column was 
washed with 10 column volumes of HSB. The column was then eluted with a gradient change to 5 mM NaCl 

per minute at a flow rate of 1 ml/min. As detected by immuno blot with Xil26, SDS-PAGE and absorbance, 
PspA eluted as a single peak at approximately 0.27 to 0 JO M NaCL By SDS-PAGE the material was 
approximately 90 percent pure. The yield from 6 liters of culture was 2 mg (Bio-Rad protein assay) of 
recombinant PspA. 

Growth of pneumococcl for challenge 

Mice were challenged with log-phase pneumococci grown in THY. For challenge, the pneumococci 
were diluted directly into lactated Ringer's without prior washing or centrifugation. To inject the desired 
numbers of pneumococci, their concentration in lactated Ringer's solution was adjusted to an O.D. of about 0.2 
at 420 nM (LKB Ultrospec III spectrophotometer). The number of pneumococci present was calculated at 
5xl0 8 CFU per ml / O* D. and confirmed by colony counts (on blood agar) of serial dilutions of the inoculum. 

Immunization, challenge, and bleeding of mice 

* 

CBA/CAHN/XID/J (CBA/N) and BALB/cByJ (BALB/c) mice were purchased from Jackson Laboratory 
Bar Harbor, ME. Mice were given two injections two weeks apart and challenged i.v. two weeks later. 
Injections without CFA were given intrapertioneally in a 0.1 ml of Ringers. Where indicated, the first injection 
was given in complete Freund's adjuvant (CFA) consisting of approximately a 1:1 emulsion of antigen solution 
and CFA oil (Difco, Detroit MI). Antigen in CFA was injected inguinally in 0.2 ml divided between the two 
hind legs. All mice were boosted i.p. without adjuvant When mice were injected with media supernatants or 2 
percent choline chloride eluates of whole bacteria, the amounts of material injected were expressed as the 
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volume of media from which the injected material was derived. For example, if the clarified medium from 
pneumococci grown in CDM-CC or CDM-ET was used for immunization without dilution or concentration, the 
dose was described as 100(il. If the material was first diluted 1/10, or concentrated 10 fold, the dose was 
referred to as 10 or 1000 \il respectively. 

EUSA for antibodies to PspA 

Specific modifications of previously reported EUSA conditions ^» are described. Microtitration plates 
(Nunc Maxisorp, P.G.C Scientific, Gaithersburg MD.) were coated with undiluted supematants of Rxl and 
WG44.1 pneumococci grown in CDM-ET or 1 percent BSA in PBS. Mice were bled retro-orbitally (75 uJ) in a 
heparanized capillary tube (Fisher Scientific, Fair Lawn, N J.) The blood was immediately diluted in 0 J ml of 
one percent bovine serum albumin in PBS. The dilution of the resultant sera was 1/15 based on an average 
hematocrit of 47 percent The sera were diluted in 7 three fold dilution in microtitration wells starting at 1/45. 
Mab Xil26 was used as a positive control The maximum reproducible O. D. observed with X1126 was defined 
as "maximum O. D." The O. D. observed in the absence of immune sera or MAb was defined as "minimum O D." 
Antibody titers were defined as the dilution that gives 33 percent of maximum O. D. The binding to the Rxl 
CDM-ET coated plates was shown to be PspA-specific, since in no case did we observe £33 percent of 
maximum binding of immune sera or Xil26 on plates coated with WG44. 1 CDM-ET or BSA. 

Statistical analysis. Unless otherwise indicated P values refer to comparisons using the Wilcoxin two-sample 
rank test to compare the numbers of days to death in different groups. Mice alive at 2 1 days were assigned a 
value of 22 for the sake of calculation. P values of >0.05 have been regarded as not significant. Since we have 
never observed immunization with PspA or other antigens to make pneumococci more susceptible to infection 
the P values have been calculated as single tailed tests. To determine what the P value would have been if a 
two tailed test had been used the values given should be multiplied by two. In some cases P values were given 

for comparisons of alive versus dead. These were always calculated using the Fisher exact test. All statistical 
calculations were carried out on a Macintosh computer using InStat (San Diego, CA). 
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FspA Is the major protection-elidring component released from pneumococd grown in CDM-ET or 
CDM-CC, or released from conventionally grown pneumococd by elution with 2% CC. 

PspA-containing preparations from pneumococd were able to protect mice from fatal sepsis following 
i.v. challenge with 3 x 10 3 (100 x LD50) capsular type 3 £ pneumoniae (TableS ). Comparable preparations 
from the strains unable to make PspA (WG44. ! and JYI 1 19). or unable to make full length PspA (LM34 and 
JY2I4I) were unable to elicit protection. Regardless of the method of isolation the minimum protective dose 
was derived from pneumococd grown in from 10 - 30 of medium. We also observed 9 that supernatant! of 
log phase pneumococd grown in normal THY or CDM media could not elicit protection (data not shown). This 
finding is consistent with earlier studies 36 * 37 indicating the PspA is not normally released in quantity into the 
medium of growing pneumococd. 



Isolated PspA can elidt protection against fatal infection 

Although PspA was necessary for these preparations to elicit protection it was possible that it did not ac; 
alone. Mice were thus, immunized with purified FL-Psp A to address this question. 

Isolation ofFL-PspAJrom CDM-ET growth medium. We isolated the FL-PspA from CDM-ET rather 
than from CDM-CC medium or a 2 percent choline chloride elution of live cells, because the high levels of 
choline present in the latter solutions prevents adsorption of the PspA to the choline residues on the choline- 
Sepharosc column. PspA for immunization was isolated from strain R36A, as the strain is non-encapsulated and 
the isolated PspA could not be contaminated with capsular polysaccharide. As a control we have conducted 
mock isolations from WG44.1 since this strain has an inactivated pspA gene and produces no PspA. The results 
shown in ^^-Oare typical of isolations from 300 ml of CDM-ET medium from R36A grown pneumococd. 
We isolated 84 jig of PspA from 300 ml of medium, or about 280 jig/liter. Based on the dot blot results this 
appears to be about 15% of the PspA in the original medium; and that CDM-ET from R36A cultures contains 
about 400 ng/liter of PspA. or about 0.4 ng/ml. 

No serologically detectable-PspA was seen in the CDM-ET from WG44. 1 cultures. More significantly 
there was undetectable protein recovered from the choline-Sepharose column after adsorption of CDM-ET from 
a WG44.1 culture, indicating that PspA is the only protein that could be isolated by this procedure. Moreover 
by silver stained SDS PAGE gel the PspA isolated from R36A appeared to be homogenous ( Figure, 3 

Although autolysin can also be isolated on choline-Sepharose 20 - 50 . we did not expect it to be isolated by this 
procedure since autolysin is not released from pneumococct grown in choline deficient medium 36 . The 
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immunologic purity of the isolated PspA was emphasized by the fact that immunization with it did not elicit any 
antibodies detectable on plates coated with CDM-ET supernatant* of WG44.L 

Loading more than 300 ml on the 0.6 ml bed volume column did not result in an increased yield, which 
suggested that the column capacity had been reached. However, increasing the depth of the choline- 
Sepharose bed to greater than 05 cm* decreased the amount of PspA eluted from the column, presumably 
because of non-specific trapping of aggregates in the column matrix. The elution buffer contains 50 mM Tris 
acetate 025 M Nad and 2% choline chloride. Elution without added NaCl or with 1M NaCl resulted in lower 
yields. Elution with less than 1% CC also reduced yields. 

Immunization of mice wish purified R36A PspA. For immunization we used only the first 3 ml fraction 
of the R36A column. Mice were immunized with two injections of 1, 0.1, or 0.01 \i% of R36A PspA, spaced two 
weeks apart As controls, some mice were inoculated with a comparable dilutions of the first 3 ml fraction from 
the WG44. 1 column. Purified FL-PspA elicited antibody to PspA at all doses regardless of whether CFA was 
used as an adjuvant (Table 12). In the absence of CFA the highest levels of antibody were seen with the t \ig 
dose of PspA. In the presence of CFA, however, the 0.1 jig dose was as immunogenic as the 1 fig dose. 

To test the ability of the different doses of PspA to elicit protection against challenge we infected the 
immunized mice with two capsular type 3 strains, WU2 and A66. Although both of these strains are able to kill 
highly susceptible CBA/N XID mice at challenge doses of less than 10 2 , the A66 strain is several logs more 
virulent when BALB/c mice are used 52 . The difference in virulence of A66 and WU2, was partially 
compensated for by challenging the immunized CBA/N mice with lower doses of strain A66 than WU2. 

After immunization of CBA/N mice with 1 and 0. 1 jig doses of PspA we observed protection against 
WU2 challenge regardless of whether or not CFA was used as an adjuvant (Table 4). At the lowest dose, 0.01 
fig PspA, most of the mice immunized with PspA + CFA lived whereas most immunized with PspA alone did 
not: however, the difference was not statistically significant When immunized mice were challenged with the 
more virulent strain A66 ^ survivors were only observed among mice immunized with the 1 and 0.1 fig 
doses. There was slightly, more protection against fatal A66 infection among mice immunized with CFA than 
without, but the difference was not statistically significant When the two sample rank test was used to analyze 
the time to death of mice infected with A66 we observed a statistically significant delay in the time to death in 
each immunized group as compared to the pooled controls. 
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The 29 kDa N-terminal fragment of PspA can elicit protection against infection when injected with CFA 

We have compared the immunogenicity. with and without CFA, of an isolated 29 kDa fragment 
composed of the first 260 amino acids of PspA. Unlike the case with FL-PspA. adjuvant was required for the 29 
kDa fragment to elicit a protective response. This was observed even though the immunizing doses of the 29 
kDa antigen used were 10 and 30jig/mouse, or about 100 and 300 times the minimum dose of FL-PspA that can 
elicits protection in the absence of adjuvant 

Iitf ection with CFA revealed the presence of additional protection eliciting antigen(s) in CDM-CC, and 
CDM-ET growth medium but not in the 2 percent choline chloride eluates of five cells 

The observation that Freund's adjuvant could have such a major effect on the immunogenicity of the 29 
kDa fragment (T Mety, prompted us to reexamine the immunogens described in Table 2 to determine if 
immunization with adjuvant might enhance protection elicited by PspA-coniaining preparations or provide 
evidence for protection eliciting antigens in addition to PspA. By using CFA with the primary injection, the 
dose of PspA-containing growth medium (CDM-CC and CDM-ET) required to elicit protection was reduced 
from 10 - 30 |U (Table?) down to 1 to 3 jil (Tabl$2). When CFA was used as an adjuvant with CDM-CC and 
CDM-ET from PspA" strains WG44.1 and JY1 1 19 we were able to elicit protective immune responses if material 
from £100jU or more of media were injected- Thus, although there were apparently some protection eliciting 
components other than PspA in CDC-CC and CDM-ET growth media. PspA remained the major protection 
eliciting component even in the presence of adjuvant 

One of the media used for injection was CDM-ET in which JY2I41 had been grown. This medium 
elicited protection against WU2 challenge even when injected at doses as low as 1 \A. It should be noted that 
although this strain does not make full-length PspA, it secretes a truncated molecule comprising the fust 1 15 
amino acids of PspA into the growth medium. Thus, unlike CDM-ET fiora WG44.1 and JY1 1 19. CDM-ET from 
JY2141 has the potential to elicit PspA-specific immunity. In contrast to these results, the material eluted from 
JY214 1 with 2 percent CC was relatively non-immunogenic even when emulsified with CFA. This result is 
consistent with the fact that the U5 amino acid N-terminal PspA fragment of JY2141 is not surface attached 37 . 
and would be expected to be washed away prior to the elution with 2 percent CC 
Extension of studies to BALB/c mice and i.p. challenge route 

The studies above all involve i.v. challenge of CBA/N mice expressing with the XID genetic defect The 
i.v. route, used in the present studies provides a relevant model for bacteremia and sepsis, but pneumococci 
have higher LDtfs when injected i.v. than i.p. CBA/N mice are hypersusceptible to pneumococcal infection 
because of the XID defect. This genetic defect prevents them from having circulating naturally occurring 
antibody to phosphocholine. The absence of these antibodies has been shown to make XID mice several logs 
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immunologic purity of the isolated PspA was emphasized by the fact that immunization with it did not elicit any 
antibodies detectable on plates coated with CDM-ET supematants of WG44.1. 

Loading more than 300 ml on the 0.6 ml bed volume column did not result in an increased yield, which 
suggested that the column capacity had been reached. However, increasing the depth of the choline- 
Sepharose bed to greater than 0.5 cm. decreased the amount of PspA eluted from the column, presumably 
because of non-specific trapping of aggregates in the column matrix. The elution buffer contains 50 mM Tris 
acetate 0.25 M NaCl and 2% choline chloride. Elution without added NaCl or with 1M NaCl resulted in lower 
yields. Elution with less than 1 % CC also reduced yields. 

Immunization of mice with purified R36A PspA. For immunization we used only the first 3 ml fraction 
of the R36A column. Mice were immunized with two injections of 1. 0.1. or 0.01 \l% of R36A PspA. spaced two 
weeks apart. As controls, some mice were inoculated with a comparable dilutions of the first 3 ml fraction from 
the WG44. 1 column. Purified FL-PspA elicited antibody to PspA at all doses regardless of whether CPA was 
used as an adjuvant (Table H). In the absence of CPA the highest levels of antibody were seen with the 1 jig 
dose of PspA. In the presence of CFA. however, the 0.1 ^g dose was as immunogenic as the 1 jig dose. 

To test the ability of the different doses of PspA to elicit protection against challenge we infected the 
immunized mice with two capsular type 3 strains. WU2 and A66. Although both of these strains are able to kill 
highly susceptible CBA/N XID mice at challenge doses of less than 10 2 , the A66 strain is several logs more 
virulent when BALB/c mice are used 47 » 52 . The difference in virulence of A66 and WU2, was partially 
compensated for by challenging the immunized CBA/N mice with lower doses of strain A66 than WU2. 

After immunization of CBA/N mice with 1 and 0. 1 jig doses of PspA we observed protection against 
WU2 challenge regardless of whether or not CFA was used as an adjuvant (Table 4). At the lowest dose. 0.0 1 
fig PspA. most of the mice immunized with PspA + CFA lived whereas most immunized with PspA alone did 
not; however, the difference was not statistically significant When immunized mice were challenged with the 
more virulent strain A66 53 i survivors were only observed among mice immunized with the I and 0.1 Jig 
doses. There was slightly, more protection against fatal A66 infection among mice immunized with CFA than 
without, but the difference was not statistically significant. When the two sample rank test was used to analyze 
the time to death of mice infected with A66 we observed a statistically significant delay in the time to death in 
each immunized group as compared to the pooled controls. 
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more susceptible lo pneumococci than isogenic mice lacking the immune defect. From the data in TabUl4it is 
clear, however, that immunization with PspA can protect against infection in mice lacking the XID defect even 
when the challenge is by the i.p. route. Thus, there is no reason to suspect that the results presented are 
necessarily dependent on the use of the CB A/N XID mouse or the i.v. route. 

PspA Is highly immunogenic 

These studies provide the first quantitative data on the amount of purified FL-PspA that is required to 
elicit protective immunity in mice. The isolated PspA for these studies was obtained by taking advantage of the 
fact that the C-terminal half of PspA binds to cell surface choline 3 ^. The isolated FL-PspA was found to be 
highly immunogenic in the mouse. Only two injections of 100 ng of PspA in the absence of adjuvant were 
required to elicit protection against otherwise fatal sepsis with greater than 100 LD50 of capsular type 3 S. 
pneumoniae. When the first injection was given with adjuvant, doses as small as 10 ng could elicit protective 

response. The potent immunogenicity of PspA, and the ability to isolate it on choIine-Sehparose 
columns provides a demonstration for the possible use of PspA as a vaccine in humans. 

A large body of published 17 ' 29* 37 ^ wcU ^ ^^1^ evidence indicates that the major protection 
eliciting epitopes of PspA are located in the a-helical (N-terrainal) half of the molecule. From the present 
studies, it is clear that immunization with N-terminal fragments containing the first 1 15 or 260 of the 288 amino 
acid a-helicai region are able to elicit protection when given with CFA However, these fragment were not able 
to elicit protective responses without CFA. In the case of the both the 1 15 and 260 amino acid fragments, even 
immunization at 100 times the minimum dose that is immunogenic for FL-PspA failed to elicit a protective 
response. This result is consistent with previous results showing that a fragment composed of the N-terminal 
245 amino acids 3 *• 37 could elicit protection against otherwise fatal pneumococcal infection of mice when the 
immunization was given with CFA 32 . In that study no immunization without CFA was attempted. Even 
though the C-tenninal half of PspA may not contain major protection-eliciting epitopes it appears to contain 
sequence important in the immunogenicity of the molecule as a whole, since the full length molecule elicited 
much greater protection than the N-terminal fragments: The effect of the C terminal half on antigenicity may be 
in p*rt that it doubles the size of the immunogen. Molecules containing the C-terminal half of PspA may also 
be especially immunogenic because they exhibit more extensive aggregation than is seen with fragments 
expressing only the a-helical region 38 . Protein aggregates are known to generally be more antigenic and less 
tolerogenic than individual free molecules 54 . 
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PspA Is the major protection eliciting component of our pneumococcal extracts 

Evidence that PspA is the major protection eliciting component of the CDM-ET. CDM-CC growth media 
and the two percent CC eluates was dependent on the use of mutant pneumococci that lacked the ability to 
produce FL-PspA. More than one pspA mutant strain was used to insure that the failure to elicit protection in 
the absence of FL-PspA was not a spurious result of non-PspA mutation blocking the production of some other 
antigen- Strains WG44.1 and JYll 19 contain identical deletions that include the 5' end of the pspA genes and 
extend about 3 kb upstream of pspA 37 . WG44.1 is a mutant of the non-encapsulated strain Rxl and JYI 1 19 
was made by transforming capsular type 3 strain WU2 with the WG44.1 pspA mutation. In no case were 
preparations from WG44. 1 and JYI 119 as efficient at eliciting protection as those from the PspA+ strains. To 
rale out the possibility that protection elicited by preparations from the PspA + strains was elicited by some non- 
PspA molecule also encoded by a 3 kb deletion linked to the mutant pspA genes of WG44. 1 and JYI 1 19, we 
also used strains JY2I41 and LM34 26 » 37 . In these strains the Rxl pspA gene has been insenionally 
inactivated causing the production of N-terminal fragments of 1 15 and 245 amino acids respectively. These 
strains have no other known mutations. Although Rxl and R36A are closely related non-encapsulated strains, 
some of the studies included Rxl as the PspA+ control since it is the isogenic partner to WG44.1. LM34, and 
JY2141. The N terminal fragments produced by JY2I41 and LM34 lack the surface anchor and are secreted 
into the medium 36 . Two percent CC eluates of JY2141 were non-protection eliciting even in the presence of 
adjuvant. In the absence of adjuvant, CDM-ET from JY2141 was not protection-eliciting. LM34 was tested 
without CFA in only 3 mice, but gave results consistent with those obtained with JY2141. 

Anticapsular antibodies are known to be protective against pneumococcal infection 5 ' 19 . However, in 
these studies it is unlikely that they account for any of the protection we attributed to PspA. Our challenge 
strain bore the type 3 capsular polysaccharide and our primary source of PspA was strain R36 A. which is a 
spontaneous non-encapsulated mutant of a capsular type 2 strain 39 ' 41 . The R36A strain has been recently 
demonstrated to lack detectable type 3 capsule on the surface or in its cytoplasm 5S . Furthermore, the CBA/N 
mice used in most of the studies are unable to make antibody responses to capsular type 3 polysaccharide 56. 

Non-PspA protection eliciting components 

The observation that CDM-CC and CDM-ET supernatants of WG44.1 could elicit protection when 
injected in large amounts with adjuvant, suggested that these supernatants contained at least trace amounts of 
non-PspA protection eliciting molecules. In the case of preparations containing PspA cluted from the surface of 
live washed pneumococci with 2 percent CC there was no evidence for any protection eliciting components 
other than PspA, presumably because the protection-eliciting non-PspA proteins released into the media were 
removed by the previous washing step. The identity of the protection eliciting molecules in the WG44.1 
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supernatant are unknown. In this regard, it is of interest that unlike R36A, strain Rxl has been shown to 
contain a very small amount of cytoplasmic type 3 polysaccharide (but totally lacks surface type 3 
polysaccharide^) . This difference from Rxl apparently came about through genetic manipulations in the 
construction of Rxl fromR36A 39 * 41 . Thus, preparations made from Rxl or from its daughter strains WG44.K 
LM34, or JY2141 could potentially contain small amounts of capsular polysaccharide. For a number of reasons 
however, it seems very unlikely that the non-PspA protection-eliciting material identified in these studies was 
type 3 capsular polysaccharide (expressed by the WU2 challenge strain: 1) growth of these strains was either 
in CDM-CC or CDM-ET, each of which prevent autolysin activity and lysis 37 that would be required to 
release the small amount of type 3 polysaccharide from the cytoplasm of the Rxl family of strains; 2) CBA/N 
mice made protective responses to the non-PspA antigens, but express the XID immune response deficiency 
which permits responses to proteins, but blocks antibody to most polysaccharides 46, including type 3 capsular 
polysaccharide and 3) immunogerucity of the non-PspA component required CFA. an adjuvant known to 
stimulate T-dependent (protein) rather than T-independent (polysaccharide) antibody responses. 

A number of non-PspA protection eliciting pneumococcal proteins have been identified: pneumolysis 
autolysis neuraminidase, and PsaA which are 52, 36*5, 107 and 37 kDa respectively 2 1 , 58, 59. W The non* 
PspA protection eliciting components reported here could be composed of a mixture of these and/or other non- 
identified proteins. Attempts to identify lambda clones producing non-PspA protection eliciting proteins as 
efficacious as PspA have not been successful'^ . 

Isolation of PspA 

The protective capacity of the CDM-CC, CDM-ET and material eiuted from live cells with 2% CC were 
similar in terms of the volume of the original culture from which the injected dose was derived. The major 
advantage of eluting the PspA from the surface of pneumococci with 2 percent CC is that the pneumococci 
may be grown in any standard growth medium, and do not have to be first adapted to a defined medium. 
Moreover, concentration of PspA can be accomplished by centrifugation of the pneumococci prior to the 
elution of the PspA. An advantage of using either CDM-CC and CDM-ET media was that these media 
prevented lysis and pneumococci could be grown into stationary phase without contaminating the 
preparations with cytoplasmic contents and membrane and wall components. A particular advantage of CDM- 
ET growth medium is that since it lacks high concentrations of choline the PspA contained in it can be 
adsorbed directly to a choline-Sepharose column for affinity purification. 

One liter of CDM-ET growth medium contains about 400 fig of PspA. and we were able to isolate about 
3/4 of it to very high purity. At 0. 1 jxg/dose, a liter of CDM-ET contains enough PspA to immunize about 4,000 
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mice; or possibly 40 - 400 humans. Our present batch size for a single column ran is only 300 ml of CDM-ET. 
This couid presumably be increased by increasing the amount of the adsorbent surface by increasing the 
diameter of the column. Using our present running buffer we have found that a choline-Sepharose resin depth 
of 05 cm was optimal; increases beyond 0.5 cm caused the overall yield to decrease rather than increase, even 
in the presence of larger loading volumes of R36A GDM-ET 
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Table 8 Pneumococcal Strains 



Strain 


Capsule 


PspA 


Parent 


Construction 


References 




type 


expressed 


strain 


technique 




D39 


2 


full length 




clinical isolate 


26,44 


R36A 


non- 
encap- 
sulated 


full length 


D39 


non-encapsulated 
mutant 


23. 44, 45 


Rxl 


non- 
sulated 


full length 


R36A 


derived from R36 A 


26,39,41 


WG44.1 


non- 
encap- 
sulated 


none 


Rxl 


aberrant insertion 
inactivation with 
pKSD300 


26,37 


LM34 


non- 
encap- 
sulated 


aa 1-245 of Rxl » 


Rxl 


insertional 
inactivation with 
pKSD300 


26 t 37, 42 


JY2141 


non- 
encap- 
sulated 


aa 1-115 of Rxl* 


Rxl 


insertional 
inactivation with 
pJY4208 


37 


WU2 


3 


full length 




clinical isolate 


25, 46 


JYII19 


3 


none 


WU2 


transformation with 
WG44.1 DNA 


37 


A66 


3 


full length 




clinical isolate 


44.47 



a LM34 and LY2141 express fragments containing the first 245 and first 1 15 amino acids of Rxl PspA 
respectively. 
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Table 9 PspA Is the major protection-eliciting 
component in antigen preparations made by three 
different methods 





Strain 


Dose as 


Median Alive: 


P versus 


Prenaration 


(PspA 


volume 


Davs 


Dead 


controls** 


status) 


of media 


Alive 










inul* 

ill 










R36A 


1000 


>21 


2:0 




dusieftom 


1 % 9 


200 


>21 


2:0 




liveccIU 


20 


>21 


2:0 








2 


1.5 


0:2 








ill R36A 


>zi 


0 • * 


A M 

vJDJ 




JV2UI 


f|UU| 

1000 










<ui-U5> 


200 


J 


0:2 






20 


1 


0:2 




CDM-CC 


Rxt 


too 


>21 


9:0 


<OJQO0\ 


deified 


(PijA*) 


30 


>21 


2:1 




medium 




10 




1 :2 








3 


2 


0:3 








ALL 


2>21 


12:6 


0X004 




LM34 


100 


2.2*>2I 


1:2 






WG44.1 


100 


2 


0:9 






(PipA") 


30 


2 


0:3 






10 


2 


0 : 3 








4 




V • J 






WU2 


1000 


>21 


3:0 


0.05 




(PjpA+) 


too 


>2l 


1:0 


0.03 




ALL 


>21 


4:0 




JYU19 


1000 


4 


0:3 

* 






CftpA") 












CDM-CC 


100 


2 


0:1 




CDM-ET 


R36A 


100 


>21 


8:0 


<0.0001 






10 


3.>2l 


5:5 


0.004 




I 


1J 


3:5 








0.1 


2 


0:2 








ALL 


>21 


16:12 


0.006 




JY2141 


100 


M 


0:2 






(ai 1-115) 


10 


LS 


0:2 






WG44.1 


100 


a 


0:2 






(PipA") 


10 


u 


0:2 




None 






2 


0:14 





Antigen aose is given as me volume or groww mewa own wmra ws w.i «u» "T • 

Each mouse was injected twice i.p. with the indicated dose diluted as necessary in lactated Ringer's injection 



b Controls used for statistical comparisons: 2% CC,aIl JY2141; CDM-CC Rxl, all WG44.1; CDM-CC WU2. 
JYI119; CDM-ET, all WG44.1 + all JY2141. 
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TablelO Isolation of PspA from 300ml of CDM-ET media after the growth of R36A or WG44.1 
pneumococci* 



R36A WG44.1 



fraction 


protein/ml 


total u\g 
protein© 


max. 
reciprocal 
dotb!ot c 


total dot 

blot 
units©, d 


\ig protein 
pet/ml 


total \ig 

protein'' 


max. 
reciprocal 

dot blot c 


growth media 


133 


3.990 


4 


1200 


13.7 


4,110 


<1 


fall-through 


13.6 


4,080 


I 


300 


13.5 


4.050 


<l 


1st wash 






<1 








<i 


10th wash 






<1 








<i 


elution #1 


26 


78 


256 


770 


<1 




<i 


elution #2 


2 


6 


16 


48 


<1 




<i 


elution #3 


<1 




4 


12 


<1 




<i 


total eluted 




84 . 




830 






<i 



a The columns were loaded with 300 ml of clarified CDM-ET medium after the growth of R36A or WG44. 1 . 

The column was washed with 10 sequential 3 ml fractions of TBA. Elution was with TBA plus 2 percent CC 
0 Total jig protein or total dot blot units reflect the total protein in the 300 ml of the loading material or the 3 m! 

size of the eluted fractions. 
c MAb XiR278 was used in the immunoblots to detect PspA in dot blots. 
d Dot blot units were calculated as the reciprocal dot blot titer times the volume in ml. 
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Table II Purified full-length PspA is able to elicit protection against fatal sepsis in mice. 



Antigen 


Dose* 


Adjuvant 
Diluent 


And* rspA 

titer* 
ft j\t mean 

tS*E) 




Challenge with 
10 s - 1 WU2 




Challenge with 
lO^Aft* 


Alive 
Dad 


Median 
Alive 


P vs. 

Duntu 

control 


Ded 


Median 
i/ajr* 
Alive 


P u. 
controls 0 


RJ6A 


'MS 


Ringer's 


3 J ± 0.2 


5:0 


>21 


0.015 


2:3 


4 


0.002 


(PspA* } 


0.1 


Ringcr'i 


2.6 ±0.2 


4:0 


>21 


0.041 


1:4 


4 


0.0032 


0.01 


Ringer's 


2.7 ±03 


1:4 


4 


ils. 


0:5 


3 


0.0058 




I PS 


CFA 


3.3 ±04 


5:0 


>21 


0.027 


3:2 


>21 


OJ00I2 




0,1 


CFA 


3.6 ± 0.1 


3:0 


>21 


0013 


2:3 


4 


0.0012 




0.01 


CFA 


3.1 ±04 


4: 1 


>2l 


0,015 


0:5 


3 


0.0051 


WG44.1 


3600 jd 


Ringtr*t 


<1.6 




■uL 




1:4 


3 


ax 




360 


Ringer's 


<U 


o,d._ 


iLd. 




0:5 


2 


ILS. 


36 


Ringer's 


<U6 


iLd. 


D-d. 




0:5 


2 


n.s. 




3600 til 


CFA 


<1.6 


Dud* 


ILd. 




0:5 


2 


n.i. 




360 


CFA 


<1.6 


tuL 


ILd. 




1:4 


2 


ILS. 




36 


CFA 


<1.6 


nuL 


ltd. 




0:5 


2 


n.j. 


saline 




CFA 


<l-6 


I :5 


4 




lut 


iLd. 




pooled 






<1.6 


1:5 


4 




2:28 


2 





contrail 



a For comparison with the data in Table 2, it should be noted that the 1 v 0. 1. and 0.01 ug doses were derived 
from 3600, 360, and 36 jil of R36A growth media. Equivalent dilutions of the PspA* eluate from strain 
WG44.1 were injected as controls. The amount of the WG44.1 preparations injected is listed as 3600, 360, 
and 36 ul and corresponds to the volume original growth medium from which the doses of WG44.1 was 
prepared. 

b Antibody values were expressed as reciprocal ELI5A titer. 

C P values calculated by the Wilcoxon two sample rank test By Kruskal-Wallis nonparametric ANO VA for the 
WU2 challenge was significant at P=0.01, for A66 significance was at ?<0.000L 
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Table 12 The 29 kDa N-tenninal fragment of Rxl PspA must be injected 
with adjuvant to elicit protection against WU2 a 



fig 29 kDa 
PspA 


Adjuvant 
or diluent 


Median Days 
Alive 


Alive : Dead 


P versus 
none** 


30 


CPA 


>21 


3:0 


0.0006 


3 


CPA 


>21 


3:0 




30 


Ringer's 


2 


0:3 




3 


Ringer's 


2 


1:2 




none 


CPA 


2 


0:7 




none 


Ringer's 


,2 


0:7 





■The 29 kDa fragment comprises the first 260 amino acids of PspA. 

b For the calculation of P values the 30^g and 3 jig data were pooled; mice immunized with PspA + CFA were 
compared to CFA controls; mice immunized with PspA + Ringer's were compared to controls immunized with 
Ringer's. Only the statistically significant P values are shown. The calculated P value of PspA + CFA versus 
CFA alone, was 0.0006 by both the Wilcoxon two sample rank test and the Fisher exact test. 
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Tablel3 PspA is not the only protection eliciting molecule released 
from pneumococci by interference with binding to choline on the 
surface of pneumococci 



Pre miration 


Strain 
(PspA 
status) 


Dose 
(as volume 

in MO 


Median 
Day 
Alive 


Alive: 
Dead 


P values 3 












P vs. all JY2141 




IUWI 


tooo 


>2l 


2:0 


0.02 


duitcfhxn 




200 


>21 


5:0 


livecdls 


20 


>2l 


5:0 


0j02 








i *fl 


0 0? 








>21 


17:0 


0.001 




mui 


■ AAA 

1000 


>21 








m m mm mT^ 

(u 1*115) 


200 


■ 

-l 


A • *t 






20 


1 


0:2 








2 


1 


0:2 








mil JI7I4I 


1 


2:0 














P versus pooled conl- 


CDM-CC 


Rxl 


1000 


>21 


3:0 


0.002 


clarified 




too 


>21 


4 A 

3:0 


0.002 


+ 


WU2 


1000 


>21 


3:0 


0.002 


CFA 


(PipA+) 


100 


>21 


3 : 0 






3 


>2I 


3:0 


0.002 




WG44.1 


1000 


>2l 


5:1 


<0.0001 




(PipA") 


100 


15 


2:4 


0.002 




JYUWCPipA-) 


tooo 


>21 


3:0 


0.002 




100 


>2l 


3:0 


0.002 


CDM-ET 


R36A 


1000 


>2l 


3:1 


0.004 


dvified 


(PjpA+) 


10 


>21 


4:0 


0.004 




1 


>21 


3:1 


0.004 


+ CFA 




02 


2 


0:4. 






JY2141 


10 


>2i 


2:0 






(11 1-115) 


1 


>2t 


2:0 


0.004 








>21 


4:0 




WC44.I 


100 


>21 


2:0 






(PipA") 


10 


2 


0:2 




CDM-ET only 


+ CFA 




2 


0:9 




None 


none 




U 


0:4 




Fooled Controls* 




2 


0:13 





* In cases where there were not statistically significanfresults no P value was shown. 
b-Pooled Controls" refers to "CDM-ET only* Data and "None" data. 
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RttA(PtpA+) 
W044.1 (FipA-j 



Pooled L+ tad ipw 
remits 



IOOmI 



0-06J0.03 



OXOS/0.0007 



S « .J^n^^f was elated from 100UI of CDM-ET medium. As a control mice were injected 
WOUI S^ wS ° f choUn ^° lu ™ Cffl^t from a mock isolation of PspA from the PspA^in 
I^LSSSi « »00 Ml since to is me volume CDM-ET faftfcff" 

b fJh^.fn'^SHL^"" t r°T" mp,e ™* test. TSR, or Fisher exact. FE versus pooled controls for 
^jf^?^""^^ -WG44.P and^none". Thelp 

bol ieitfwe^ IS Eb"£2S T , Whe " ^ data from tw ° studi « were pooled the /> vaEeVby 
ootu tests were S0.008. In cases where there were not statistically significant results no P value was shown. 
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EXAMPLE 4 - Evidence For Simultaneous Expression of Two PspAs 

From Southern blot analysis there has been an issue as 
to whether most isolates of S. pneumoniae has two DNA sequences 
that hybridize with both 5' and 3' halves of Rxl pspA, or whether 
this is an artifact of Southern blot. When bacterial lysates 
have been examined by Western blot, the results have always been 
consistent with the production of a single PspA by each isolate* 
This Example provides evidence for the first time that two PspAs 
of different apparent molecular weights and different serotypes 
can be simultaneously expressed by the same isolate. 

Different PspAs frequently share cross-reactive 
epitopes, and an immune serum to one PspA was able to recognize 
PspAs on all pneumococci. In spite of these similarities, PspAs 
of different strains can generally be distinguished by their 
molecular weights and by their reactivity with a panel of PspA- 
specific monoclonal antibodies (MAbs) . 

A serotyping system for PspA has been developed which 
uses a panel of seven MAbs. PspA serotypes are designated based 
on the pattern of positive or negative reactivity in immunoblots 
with this panel of MAbs. Among a panel of 57 independent 
isolates of 9 capsular groups/types, 31 PspA serotypes were 
observed. The large diversity of PspA was substantiated in a 
subsequent study of 51 capsular serotype 6B isolates from Alaska, 
provided by Alan Parkinson at the Arctic Investigations 
Laboratory of the Centers for Disease Control and Prevention. 
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Among these 51 capsular type 6B isolates were observed 22 
different PspAs based on PspA serotype and molecular weight 
variations of PspA* 

While most pneumococcal strains appear to have two DNA 
sequences homologous with both the 5' and 3' halves of pspA, 
site-specific truncation mutations of Rxl have revealed that one 
these, pspA, encodes PspA. The other sequence has been 
provisionally designated as the pspA-like sequence. At present 
whether the pspA-like sequence makes a gene product is unknown. 
Evidence that the pspA and pspA-like genes are homologous but 
distinct groups of alleles comes from Southern blot analysis at 
high stringencies. Additional evidence that pspA and the pspA- 
like loci are distinct comes from studies using PCR primers that 
permit amplification of a single product approximately 2Kb in 
size from 70% of pneumococci. For the remaining 30% of 
pneumococci no amplification was observed with the primers used. 
Evidence for two PspAs : 

When the strains of MC25-28 were examined with the 
panel of seven MAbs specific for different PspA epitopes, all 
four demonstrated the same patterns of reactivity (Fig. 4). The 
MAbs xiR278 and 2A4 detected a PspA molecule with an apparent 
molecular weight of 190 KDa in each isolate. In accordance with 
the previous PspA serotyping system, the 190 KDa molecule was 
designated as PspA type 6 because of its reactivity with XiR278 
and 2A4, but none of the five other MAbs in the typing system. 
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Each isolate also produced a second PspA molecule with an 
apparent molecular weight 82 KDa. The 82 KDs PspA in each 
isolate was detected only with the MAb 7D2 and was designated as 
type 34. No reactivity was detected with MAbs Xil26, Xi64, 1A4, 
or SR4W4. The fact that all four capsular 6B strains exhibit two 
PspAs, based on both molecular weights and PspA serotypes, 
suggested that they might be members of the same clone. 
Simultaneous production of both PspAs : 

Results from the colony immunobloting showed that both 
PspAs were present simultaneously in each colony of these 
isolates when grown in vitro. All colonies on each plate of the 
original culture, as well as all of the progeny colonies from a 
single colony, reacted with MAbs XiR278 f 2A4 , and 7D2„ 
Number of pspA genes : 

One explanation for the second PspA molecule was that 
these strains contained an extra pspA gene. Since most strains 
contain a pspA gene and a pspA-like gene it was expected that if 
an extra gene were present one might observe at least three pspA 
homologous loci in isolates MC25-28. In Hind III digests of 
MC25-28 each strain revealed a 7.7 and 3.6 Kb band when probed 
with plSMpspA13/2 (Figure 5A) . In comparison, when Rxl DNA was 
digested with Hind III and hybridized with plSMpspA13 .2 , 
homologous sequences were detected on 9.1 and 4.2 Kb fragments as 
expected from previous studies (9) (Figure 5A) . Results 
consistent with only two pspA-homologous genes in MC25-28 were 
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also obtained with digestion using' four additional enzymes (Table 
15) . 

In previous studies it has been reported that probes 
for the 5' half of pspA (encoding the alpha-helical half of the 
protein) bind the pspA-like sequence of most strains only at a 
stringency of around 90%. With chromosomal digests of MC25-28 we 
observed that the 5' Rxl probe of pLSMpspA12/6 bound both pspA 
homologous bands at a stringency of greater than 95 percent. The 
same probe bound only the pspA containing fragment Rxl at a 
stringency above 95 percent (Figure 5B) . 

Further characterization of the pspA gene was done by 
RFLP analysis of PCR amplified pspA from each strain. Since 
previous studies indicated that individual strains yielded only 
one product, and since the amplification is carried out with 
primers based on a known pspA sequence, it seems likely that in 
each case the amplified products represent the pspA rather than 
the pspA-like gene. When MC25-28 were subjected to this 
procedure, an amplified pspA product of 2.1 Kb was produced in 
each case. When digested with Hha 1 digest the sum of the 
fragments obtained with each enzyme was approximately equal to 
the size of the 2.1 Kb amplified product (Figure 6). These 
results suggest that the 2.1 Kb amplified DNA represents the 
amplified product of only a single DNA sequence. Rxl, by 
comparison, produced an amplified product of 2.0 Kb and five 
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fragments of 0*76, 0.468, 0390, 0.349 and 0.120, when digested 
with Hha 1 as expected from its known pspA sequence. 

The four isolates examined in this Example are the 
first in which two PspAs have unambiguously been observed. The 
interpretation that two PspAs are simultaneously expressed by a 
single pneumococcal isolate is based on the observation that 
bands of different molecular weights were detected by different 
MAbs to PspA. Isolates used in this study were from a group 
originally selected for study by Brian Spratt because of their 
resistance to penicillin. It is very likely that all four of the 
isolates making two PspAs are related since they share PspA 
serotypes, amplified pspA RFLPs, chromosomal pspA RFLPs, capsule 
type, and resistance to penicillin. 

The interpretation of studies presented here, showing 
the existence of two PspAs in the four strains MC25-28, must be 
set in the context of what is know about the serology PspA as 
detected by Western blots. PspAs of different strains have been 
shown previously to exhibit apparent molecular weight sizes 
ranging from 60 to 200 KDa as detected by Western blots. At 
least part of this difference in size is attributable to 
secondary structure. Even for the PspA of a single isolate, band 
of several sizes are generally observed. Mutation and 
immunochemistry studies have demonstrated, however, that all of 
the different sized PspA band from Rxl are made by a single gene 
capable of encoding a 69 KDa protein. The heterogeneity of band 
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size on Western blots of PspA made by a single strain appears to 
be due to both degradation and polymerization* 

PspA was originally defined by reciprocal absorption 
studies demonstrating that a panel of MAbs to Rxl surface 
proteins each reacted with some protein and later by studies 
using Rxl and WU2 derivatives expressing various truncated forms 
of PspA, In both cases it was clear that each MAbs to the PspA 
of a given strain reacted with the same protein. Such detailed 
studies have not been done with each of the several hundred human 
isolates. It is possible that with some isolates, reactivity of 
the MAbs with two PspAs may have gone unnoticed. This could have 
happened if all reactive antibodies detected both PspAs of the 
same isolate, or if the most prominent migration bands from each 
of the two PspAs co-migrated. With isolates MC25-28 the 
observation of two PspAs was possible because clearly 
distinguishable bands of different molecular weights reacted 
preferentially with different MAbs. 

Applicants favor the interpretation that isolates MC25- 
28 each make two PspAs, because an alternative possibility, 
namely, that the 190 KDa PspA detected by MAbs XiR278 and 2A4 
might be a dimer of the 84 KDa monomer detected by MAb 7D2, if 
the epitopes recognized by the different MAbs were dependent on 
either the dimer ic or monomer ic status of the protein, seems 
unlikely since whenever MAbs react with the PspA of a strain, 
they usually detect both the monomeric and the dimer ic forms. No 
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other isolates have been observed where some MAbs detected only 
the apparent dimeric form of PspA while others detected only the 
monomer ic form. 

There could be several possible explanations for the 
failure to observe two PspAs produced by most strains, l) All 
pneumococci might make two pspAs in culture, but MAbs generally 
recognize only one of them (perhaps in this isolate there has 
been a recombination between pspa DNA and the pspA-like locus, 
thus allowing that locus to make a product detected by MAb to 
PspA) . 2) All pneumococci can have two pspAs but the expression 
of one of them generally does not occur under in vitro growth 
conditions. 3) The pspA-like locus is normally a nonfunctional 
pseudogene sequence that for an unexplained reason has become 
functional in these isolates. 

It seems unlikely that the expression of only a single 
PspA by most strains is the result of a phase shift that permits 
the expression of only the pspA or pspA-like gene at any one 
time, since many of the strains examined repeatedly and 
consistently produce the same PspA. In the case of strains MC25- 
28, the appearance of two PspAs is apparently not the result of a 
phase switch, since individual colonies produced both the type 6 
and the type 34 PspAs. 

Presumably in these four strains, the second PspA 
protein is produced by the pspA-like DNA sequence. At high 
stringency, the probe comprising the coding region of the alpha- 
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helical half of PspA recognized both pspA homologous sequences of 
MC25-28 but not the pspA-like sequence of Rxl. This finding 
indicates that the pspA-like sequence of MC25-28 is more similar 
to the Rxl pspA sequence than is the Rxl pspA-like sequence. If 
the pspA-like sequence of these strains is more similar to pspA 
than most pspA-like sequences , it could explain why we were able 
to see the products of pspA- like genes of these strains with our 
MAbs. The finding of two families of PspAs made in vivo by 
pneumococci, allows for use of the second PspA in compositions, 
as well as the use of DNA primers or probes for the second gene 
for more conclusive detecting, determining or isolating of 
pneumococci. 

Isolates and Bacterial Cell Culture : 

Pneumococcal isolates described in these studies were 
cultured from patients in Barcelona, Spain (one adult at 
Bellvitge Hospital, and three children at San Juan de Dios) 
between 1986 and 1988 (Table 2). These penicillin resistant 
pneumococci originally in the collection of Dr. Brian Spratt were 
shared with applicants by Dr. Alexander Tomasz at the Rockefeller 
Institute. Rxl is a rough pneumococcus used in previous studies, 
and it is the first isolate in which pspA was sequenced. 
Bacteria were grown in Todd-Hewitt broth with 0.5% yeast extract 
or on blood agar plates overnight in a candle jar. Capsular 
serotype was confirmed by cell agglutination using Danish 
antisera (Statens Seruminstitut, Copenhagen, Denmark) as 
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previously described. The isolates were subsequently typed as 6B 
by Quellung reaction, utilizing rabbit antisera against 6A or €B 
capsule antigen prepared by Dr. Barry Gray. 
Bacterial lvsates ; 

Cell lysates were prepared by incubating the bacterial 
cell pellet with 0.1% sodium deoxycholate, 0.01% sodium 
dedecylsulfate (SDS) , and 0.15 M sodium citrate, and then 
diluting the lysate in 0.5M Tris hydrochloride (pH 6.8) as 
previously described. Total pneumococcal protein in the lysates 
was guantitated by the bicinchonic acid method (BCA Protein Assay 
Reagent; Pierce Chemical Company, Rockford, IL) . 
PspA serotyping : 

Serotyping of PspA was performed according to 
previously published methods. Briefly, pneumococcal cell lysates 
were subjected to SDS-PAGE, transferred to nitrocellulose 
membranes, and developed as Western blots using a panel of seven 
MAbs to PspA. PspA serotypes were assigned based on the 
particular combination of MAbs with which each PspA was reactive. 
Colony Immunoblottina : 

A ten ml tube of Todd-Hewitt broth with 0.5% yeast 
extract was inoculated with overnight growth of MC23 from a blood 
agar plate. The isolate was allowed to grow to a concentration 
of 10 7 cells/ml as determined by an O.D. of 0.07 at 590nm. MC23 
was serially diluted and spread-plated on blood agar plates to 
give approximately 100 cells per plate. The plates were allowed 
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to grow overnight in a candle jar, -and a single block agar plate 
with well-defined colonies was selected- Four nitrocellulose 
membranes were consecutively placed on the plate. Each membrane 
was lightly weighted and left in place for 5 minutes, in order 
to investigate the possibility of phase-variation between the two 
proteins detected on Western blots a single colony was picked 
from the plate, resuspended in ringers, and spread-plated onto a 
blood agar plate. The membranes were developed as Western blots 
according to PspA serotyping methods. 
Chromosomal DNA Preparation : 

Pneumococcal chromosomal DNA was prepared as in Example 9. The 
cells were harvested, washed, lysed, and digested with 0.5% 
(wt/vol) SDS and 100/xg/ml proteinase K at 37°C for 1 hour. The 
cell wall debris, proteins, and polysccharides were complexed 
with l% hexadecyl trimethyl ammonium bromide (CTAB) and 0.7M 
sodium chloride at 65 °C for 20 minutes, then extracted with 
chloroform/ isoamyl alcohol. DNA was precipitated with 0.6 
volumes isopropanol, washed, and resuspended in lOmM Tris-HCL, 
ImM EDTA, pH 8.0. DNA concentration was determined by 
spectrophotometric analysis at 260nm. 
Probe preparation : 

5' and 3' oligonucleotide primers homologous with 
nucleotides 1 to 26 and 1967 to 1990 of Rxl pspA (LSM 13 and 
LSM2, respectively) were used to amplify the full length pspA and 
construct probe LSMpspA13/2 from Rxl genomic DNA. 5' and 3' 
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oligonucleotide primers homologous to nucleotides 161 to 187 and 
nucleotides 1093 to 1117 (LSM 12 and LSM 6, respectively) were 
used to amplify the variable alpha-helical region to construct 
probe LSMpspA12/6. PCR generated DNA was purified by Gene Clean 
(BiolOl Inc., Vista, CA) and random prime-labeled with 
digoxigenin-ll-dUTP using the Genius 1 Nonradioactive DNA 
Labeling and Detection Kit as described by the manufacturer 
(Boehringer Mannheim, Indianapolis, IN) . 
DNA electrophoresis : 

For Southern blot analysis, approximately 10/zg of 
chromosomal DNA was digested to completion with a single 
restriction endonuclease, (Hind III, Kpn 1, EcoR 1, Dra 1, or Pst 
1) then electrophoresed on a 0*7% agarose gel for 16-18 hours at 
35 volts. For PCR analysis, 5ul of product were incubated with a 
single restriction endonuclease, (Bel 1, BamH 1, Pst 1, Sac 1, 
EcoR 1 Sma 1, and Kpn 1) then electrophoresed on a 1.3% agrose 
gel for 2-3 hours at 90 volts. In both case, 1 Kb DNA ladder was 
used for molecular weight makers (BRL, Gaithersburg, MD) and gels 
were stained with ethidium bromide for 10 minutes and 
photographed with a ruler. 
Southern blot hybridization 

The DNA in the gel was depurinated in 0.25N HCL for 10 
minutes, denatured in 0.5M NaOH and 1.5M NsCl for 30 minutes, and 
neutralized in 0.5M Tric-HCl (pH 7.2), 1.5M NaCl and ImM disodium 
EDTA for 3 0 minutes. DNA was transferred to a nylon membrane 

80 



WO 97/09994 



PCT/US96/14819 



(Micron Separations INC, MA) using a POSIBLOT pressure blotter 
(Strategene, La Jolla, CA ) for 45 minutes and fixed by UV 
irradiation. The membranes were prehybridized for 3 hours at 
42 °C in 50% formamide, 5X SSC, 5X Denhardt solution, 25mM sodium 
phosphate (pH 6.5), 0-5% SDS 3% (wt/vol) dextran sulfate and 
500/xg/ml of denatured salmon containing 45% formamide, 5X SSC, IX 
Denhardt solution, 20mM sodium phosphate (pH 6.5), 0.5% SDS, 3% 
dextran sulfate, 250jig/ml denatured sheared salmon sperm DNA and 
about 20ng of heat-denatured diogoxigenin-labeled probe DNA. 
After hybridization, the membranes were washed twice in 0.1% SDS 
and 2X SSC for 3 minutes at room temperature. The membranes were 
washed twice to a final stringency of 0.1% SDS in 0.3X SSC at 
65 °C for 15 minutes. This procedure yields a stringency greater 
than 95 percent. The membranes were developed using the Genius 1 
Nonradioactive DNA Labeling and Detection Kit as described by the 
manufacturer (Boehringer Mannheim, Indianapolis, IN) - To perform 
additional hybridization with other probes, the membranes were 
stripped in 0.2N NaOH/0.1%SDS at 40°C for 30 minutes and then 
washed twice in 2X SSC. 
Polymerase Chain Reaction (PCR) : 

5' and 3' primers homologous with the DNA encoding the 
N- and C-terminal ends of PspA (LSM13 and LSM2, respectively) 
were used in these experiments. Amplifications were made using 
Tag DNA polymerase, MgCl 2 and 10X reaction buffer obtained from 
Promega (Madison, WI) . DNA used for PCR was prepared using the 

81 



WO 97/09994 



PCT/US96/14819 



method previously described in this paper. Reactions were 
conducted in 50ml volumes containing 0.2mM of each dNTP, and 1ml 
of each primer at a working concentration of 50mM. MgCl 2 was 
used at an optimal concentration of - 1.75mM with 0-25 units of Taq 
DNA polymerase. Ten to thirty ng of genomic DNA was added to 
each reaction tube. The amplification reactions were performed 
in a thermal cycler (M.J. Research, Inc.) using the following 
three step program. Step 1 consisted of a denaturing temperature 
of 94 °C for 2 minutes. Step 2 consisted of 9 complete cycles of 
a denaturing temperature of 94 °C for 1 minute, an annealing 
temperature of 50°C for 2 minutes, and an extension temperature 
of 72 °C for 3 minutes. Step 3 cycled for 19 times with a 
denaturing temperature of 94 °C for 1 minute, an annealing 
temperature of 60°C for 2 minutes, and an extension temperature 
of 72 °C for 3 minutes. At the end of the last cycle, the samples 
were held at 72 °C for 5 minutes to ensure complete extension. 
Band size estimation : 

Fragment sizes in the molecular weight standard and in 
the Southern blot hybridization patterns were calculated from 
migration distances. The standard molecular sizes were fitted to 
a logarithmic regression model using Cricket Graph (Cricket 
Software, Malvern, PA) . The molecular weights of the detected 
bands were estimated by entering the logarithmic line equation 
obtained by Cricket Graph into Microsoft Excel (Microsoft 
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Corporation, Redmond, WA) in order to calculate molecular weight 
based in migration distances observed in the Southern blot. 
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Table J£ 



Restriction 
Enzyme 



Hind III 
Kpn I 
EcoRI 
Dra I 
Pst I 



Strains Examined 
MC25 MC26 MC27 RX1 



+ 



+ 
+ 



Restriction Fragments 



RX1 


MC25 


-MC28 


RX1 


+ 


7.7, 


3.6 


9.1. 4.2 


+ 


11.6. 


10.6 


10.6. 9.8 


+ 


8,4, 


7.6 


7.8, 6.6 


+ 


2.1, 


1.1 


1.9, 0.9 




>14, 


6.1 


10.0, 4.0 



?3A 
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Table \£* Penicillin Resistant Capsular Serogroup 6 Strains from Spain 

Isolate Penicillin MIC fuo/mH Year Site Hospital 

MC25 1 1986 sputum Bellvitge 

MC26 4 1988 ear San Juan de Dios 

MC27 1 1988 ear San Juan de Dios 

MC26 2 1988 ? San Juan de Dios 
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EXAMPLE 5 - Southern blot anal ysis of p spAs and Fragments of ospA 

In this example, Applicants used oligonucleotides 
derived from the DNA sequence of pspA of S. pneumoniae Rxl both 
as hybridization probes and as primers in the polymerase chain 
reaction to investigate the genetic variation and conservation of 
the different regions of pspA and pspA-like sequences. The 
probes used ranged in size from 17 to 33 bases and included 
sequences representing the minus 35, the leader, the a-helical 
region, the proline-rich regions, the repeat regions, and the C- 
terminus. Applicants examined 18 different isolates representing 
12 capsular and 9 PspA serotypes. The proline-rich, repeat, and 
leader, regions were highly conserved among pspA and pspA-like 
sequence. 

In the previous Example, it was shown that strain Rxl 
and most other strains of S. pneumoniae had two homologous 
sequences that could hybridize with probes encoding the N 
terminal and C terminal halves of PspA. This conclusion that 
these were separate sequences was supported by the fact that no 
matter which restriction enzymes was used there were always at 
least two (generally two sometimes three or four) restriction 
fragments of Rxl and most other strains hybridized with the pspA 
probes. When the genome of Rxl was digested with Hindlll and 
hybridized with these, two pspA-homologous sequences were found 
to be in 4.0 and 9.1 kb fragments. Using derivative of Rxl which 
had insertion mutations in pspA, it was possible to determine 
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that the 4.0 kb fragment contained the functional pspA sequence. 
The pspA-homologous sequence included within the 9.1 kb band was 
referred to as the pspA-like sequence. Whether or not the pspA- 
like sequences makes a product is not know, and none has been 
identified in vitro. Since pspA-specif ic mutants can be 
difficult to produce in most strains, and exist for only a 
limited number of pneumococcal isolates, this Example identifies 
oligonucleotide probes that could distinguish between the pspA 
and pspA-like sequences. 

The purpose of this Example was to further define both 
the conserved and variable regions of pspA, and to determine 
whether the central proline-rich region is variable or conserved, 
and identify those domains of pspA that are most highly conserved 
in the pspA-like sequence (and ergo, provide oligonucleotides 
that can distinguish between the two) . Oligonucleotides were 
used and are therefore useful as both hybridization probes and as 
primers for polymerase chain reaction (PCR) analysis. 
Hybridization with oligonucleotide probes . 

The oligonucleotides used in this study were based on 
the previously determined sequence of Rxl PspA. Their position 
and orientation relative to the structural domains of Rxl PspA 
are shown in Figure 7. The reactivity of these oligonucleotide 
probes with the pspA and pspA-like sequences was examined by 
hybridization with a Hindlll digest of Rxl genomic DNA (Table 
17). As expected, each of the eight probes recognized the pspA- 
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containing 4.0 kb fragment of the Hindlll digested Rxl DNA. Five 
of the 8 probes (LSM1, 2, 3, 7, and 12) could also recognize the 
pspA-like sequence of the 9.1 kb band at least at low stringency. 
At high stringency four of the probes (LSM2, 3, 4 and 5) were 
specific for the 4.0 kb. 

These 8 probes were used to screen Hindi I I digest of 
the DAN from 18 strains of S. pneumoniae at low and high 
stringency. For comparison to earlier studies each of the 
strains was also screened using a full-length pspA probe. Table 
23 illustrates the results obtained with each strain at high 
stringency. Table 18 summarizes the reactivities of the probes 
with the strains at high and low stringency. Strain Rxl is a 
laboratory derivative of the clinical isolate/ D39. The results 
obtained with both strains were identical. They are listed under 
a single heading in Table 23 and are counted as a single strain 
in Table 28. Although AC17 and AC94 are related clinical 

isolates, they have distinguishable pspAs and are listed 

ft 

separately. All of the other strains represent independent 
isolates • 

The only strain not giving at least two pspA-homologous 
Ifindlll fragments was WU2. This observation was expected since 
WU2 was previously shown to have only one pspA-homologous 
sequence and to give only a single Hindlll fragment that 
hybridizes with Rxl pspA. Even at high stringency 6 of the 8 
probes detected more than one fragment in at least one of the 18 
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strains Tables 18 and 23. Probes LSM7, 10 and 12 reacted with 
DNA from a majority of the strains and detected two fragments in 
over 59% of the strains they reacted with. In almost every case 
the fragments detected by the oligonucleotide probes were 
identical in size to those detected by the full-length pspA 
probe. Moreover, the same pairs of fragments were frequently 
detected by probes from the 3' as well as the 5' half Rxl pspA. 
These results are consistent with earlier findings that the pairs 
of Hindlll fragments from individual isolated generally include 
two separate but homologous sequences, rather than fragments of a 
single pspA gene. 

The differences in the frequency with which the 
oligonucleotides reacted with (at least one fragment) of the 
strains in the panel was significant at P < 0.0001 by 2 x 8 chi 
square) . When the oligonucleotides were compared in terms of 
their ability to react with both fragments of each strain the P 
value was also < 0.0001. Table 18 gives the percentage of 
strains reactive with each probe, the percentage in which only 
one fragment was reactive, and the percentage in which two (or 
more) fragments were reactive. 

The last column in Table 18 give the ratio of strains 
that showed one reactive Hindlll fragment at high stringency 
divided by the total number of reactive strains. In this column 
values of 1 were obtained with probes that only reacted with one 
band in each reactive strain. Such probes are assumed to be 
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those that are most specific for pspA. The lowest values were 
obtained with probes that generally see two bands in each strain* 
Such probes are assumed to be those that represent regions 
relatively conserved between the pspA and pspA-like sequences. 
At high stringency, probes LSM3 and LSM4 detected only a single 
Hindlll fragment in the DAN of strains they reacted with. These 
findings suggested probes LSM3 and LSM4 were generally detecting 
alleles of pspA rather than the pspA-like sequence. The 
observation that the fragments detected by LSM3 or LSM4 were also 
detected by all of the other reactive probes, strengthened the 
conclusion that these probes generally detected the pspA rather 
than the pspA-like sequence. WU2 has only one pspA-homologous 
DNA sequence and secretes a serologically detectable PspA. The 
fact that LSM3 reacts with the single Hindlll fragment of WU2 is 
consistent with the interpretation that LSM3 detects the pspA 
sequences. Sequences representing the second proline region 
(LSM1) and the C-terminus (LSM2) appeared to also be relatively 
specific for the pspA sequences since they were generally 
detected in only one of the Hindlll fragments of each strain. 

Oligonucleotides, LSM12, and LSM10 detected the most 
conserved epitopes of pspA and generally reacted with both pspA- 
homologous fragments of each strain (Table 18) . LSM7 was not 
quite as broadly cross-reactive but detected two PspAs in 41% of 
strains including almost 60% of the strains it reacted with. 
Thus, sequences representing the leader, first proline region, 
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and the repeat region appear to be relatively conserved not only 
within pspA but between the pspA and pspA-like sequences. LSM3, 
4, and 5 reacted with the DNA from the smallest fraction of 
strains of any oligonucleotide (29 - 35 percent), suggesting that 
the portion of pspA encoding the a-helical region is the least 
conserved region of pspA* 

With two strains BG85C and L81905, the oligonucleotides 
detected more than two Hindlll fragments containing pspA- 
homologous sequences. Because of the small size of the 
oligonucleotide probes and the absence of Hindlll restriction 
sites within any of them, it is very unlikely that these multiple 
fragments were the results of fragmentation of the target DNA 
within the probed regions. In almost every case the extra 
oligonucleotides were detected at high stringency by more than 
one oligonucleotide. These data strongly suggest that at least 
in these two strains there are 3 or 4 sequences homologous to at 
least portions of the pspA. The probes most reactive with these 
additional sequences are those for the leader, the a-helical 
region and the proline rich region. The evidence for the 
existence of these additional pspA-related sequences was 
strengthened by results with BG58C and L81905 at low stringency 
where the LSM3 (a-helical) primer picked up the extra 1.2 kb band 
of L81905 (in addition to the 3.6 kb band) and the LSM7 (proline- 
rich) primer picked up the extra 3.2 and 1.4 kb bands (in 
addition to the 3.6 kb band) of BG58C. 
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Amplification of psoA 

The utility of these oligonucleotides as PGR primers 
was examined by determining if they could amplify fragments of 
pspA from the genomic DNA of different pneumococcal isolates. 
Applicants attempted to amplify pspAs from 14 diverse strains of 
S. pneumoniae comprising 12 different capsular types using 
primers based on the Rxl pspA seguence. Applicants observed that 
the 3' primer LSM2, which is located at the 3' end of pspA, would 
amplify an apparent pspA seguence from each of the 14 
pneumococcal strains when used in combination with LSM1 located 
in the region of pspA encoding the proline-rich region (Table 
19). LSM2 was also used in combination with four other 5' 
primers LSM1, 3, 7, 8 and 12. LSM8 is located 5' of the pspA 
start site (near the -35 region) . 

If a predominant sequence of the expected length was 
amplified that could be detected on a Southern blot with a full- 
length pspA probe, we assumed that pspA gene of the amplified DNA 
had homologous sequences similar to those of the pspA primers 
used. Based on these criteria the primer representing the a- 
helical sequence was found to be less conserved than the primers 
representing the leader, proline, and C- terminal sequences. 
These results were consistent with those observed for 
hybridization. The lowest frequency of amplification was 
observed with LSM8 which is from the Rxl sequence 5' of the pspA 
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start site. This oligonucleotide was not used in the 
hybridization studies* 

Further evidence for variability comes from differences 
in the sizes of the amplified pspA gene. The Example showed that 
when PCR primers LSM12 and LSM2 were used to amplify the entire 
coding region of PspA, PCR products from different pneumococcal 
isolates ranged in size from 1.9 and 2.3 kb (Table 20). The 
regions within pspA encoding the a-helical, proline-rich, and 
repeats were also amplified from the same isolates. As seen in 
Table 20, the variation in size of pspA appeared to come largely 
from variation in the size of pspA encoding encodes the a-helical 
region. 

Using probes that consisted of approximately the 5' and 
3' halves of pspA it has been determined that the portion of pspA 
that encodes the a-helical regions is less conserved than the 
portion of pspA that encodes the c- terminal half of the molecule. 
This Example show using 4 oligonucleotide probes from within each 
half of the DNA encoding PspA. Since a larger number of smaller 
probes were used, Applicants have been able to obtain a higher 
resolution picture of conserved and variable sequences within 
pspA and have also been able to identify regions of likely 
differences and similarities between pspA and the pspA-like 
sequences. 

The only strains in which the pspA gene has been 
identified by molecular mutations are Rxl, D39 and WU2 . Rxl and 
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D39 apparently have identical pspA molecules that are the result 
of the common laboratory origin of these two strains. WU2 lacks 
the pspA-like gene. Thus, when most pneumococci are examined by 
Southern blotting using full length-pspA as a probe, it is not 
possible to distinguish between the pspA and pspA-1 ike loco, 
since both are readily detected. A major aim of these studies 
was to attempt to identify conserved and variable regions within 
the pspA and pspA-like loci. A related aim was to determine 
whether probes based on the Rxl pspA could be identified that 
would permit one to differentiate pspA from the pspA- like 
sequence. Ideally such probes would be based on relatively 
conserved portion of the pspA sequence that was quite different 
in the pspA-like sequence. A useful pspA specific probe would be 
expected to identify the known Rxl and WU2 pspA genes and 
identify only a single Hindlll fragment in most other strains. 
Two probes (LSM3 and LSM4) never reacted with more than one pspA- 
homologous sequence in any particular strain. Both of reacted 
with Rxl pspA and LSM3 reacted with WU2 pspA. Each of these 
probes reacted with 4 of the other 15 strains. When these probes 
identified a band, however, the band was generally also detected 
by all other Rxl probes reactive with that strain's DNA. 
Additional evidence that the LSM3 and LSM4 were restricted to 
reactivity with pspA was that they reacted with the same bands in 
all three non-Rxl strains. Each probe identifies pspA in certain 
strains and even when used in combination they recognized pspA in 
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over 40 percent of strains. Probes for the second proline-rich 
region (LSM1) and the C- terminus of pspA (LSM2) generally, but 
not always, identified only one pspA-homologous sequence at high 
stringency. Collectively LSM1, 2, 3, and 4 reacted with 16 of 
the 17 isolates and in each case revealed a consensus band 
recognized by most to all of the reactive probes. 

By making the assumption that in different strains the 
Rxl pspA probes are more likely to recognize pspA than the pspA- 
like sequences, it is possible to make some predictions about 
areas of conservation and variability within the pspA and pspA- 
like sequences. When a probe detected only a single pspA- 
homologous sequence in an isolate, it was assumed that it was 
pspA. If the probe detected two pspA-homologous sequences, it 
was assumed that it was reacting with both the pspA and pspA-like 
sequence. Thus, the approximate frequency with which a probe 
detects pspA can be read from Table 18 as the percent of strains 
where it detects at least one pspA-homologous band. The 
approximate frequency with which the probes detect the pspA-like 
sequence is the percent of strains in which two or more pspA- 
homologous band are detected „ 

Using these assumptions the most variable portion of 
portion of the pspA gene was observed to be the -35 region and 
the portion encoding a-helical region. The most conserved 
portion of pspA was found to be the repeat region, the leader and 
the proline rich region. Although only one probe from the region 
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was used, the high degree of conservation among the 10 repeats in 
the Rxl sequence makes it likely that other probes for the repeat 
regions give similar results* 

The portion of the pspA-like sequence most similar to 
Rxl pspA was that encoding the leader sequence, the 5' portion of 
the proline rich region, and the repeat region, and those 
portions encoding the N-terminal end of the proline-rich and 
repeat regions. The repeat region of PspA has been shown to be 
involved in the attachment to PspA to the pneumococcal surface. 
The conservation of the repeat region among both pspA and pspA- 
like genes suggests that if is PspA-like protein is produced, 
that it may have a surface attachment mechanism similar to that 
of PspA. The need for a functional attachment site may explain 
the conservation of the repeat region. Moreover, the 
conservation in DNA encoding the repeat regions of the pspA and 
pspA-like genes suggests that the repeat regions may serve as a 
potential anti-pneumococcal drug target. The conservation in the 
leader sequence between pspA and the pspA-like sequence was also 
not surprising since similar conservation has been reported for 
the leader sequence of other gram positive proteins, such as M 
protein of group A streptococci* It is noteworthily # however, 
that there is little evidence at the DNA level that the PspA lead 
is shared by many genes other than PspA and the possible gene 
product of the pspA-like locus. 
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Although the region encoding the C-terminus of pspA 
(LSM12) or the 3' portion of the proline-rich sequence (LSM1) 
appear to be highly conserved within pspA genes, corresponding 
regions in the pspA-like sequences are either lacking, or very 
distinct from those in pspA. The reason for conservation at 
these sites is not apparent. In the case of the PspA, its C- 
terminus does not appear to be necessary for attachment, since 
mutants lacking the C-terminal 49 amino acids are apparently as 
tightly attached to the cell surface as those with the complete 
sequence. Whether these difference from pspA portends a subtle 
difference in the mechanism of attachment of proteins produced by 
these two sequences in unknown. If the C-terminal end of the 
pspA-like sequence, or the 3' portion of the proline-rich 
sequence in the pspA-like sequence are as conserved within the 
pspA-like family of genes as it is within pspA, then this region 
of pspA and the pspA- like sequence serve as targets for the 
development of probes to distinguish between all pspA and pspA- 
like genes. 

With two strains, some of the oligonucleotide probes 
identified more than two pspA-homologous sequences. In the case 
of each of these strains, there was a predominant sequence 
recognized by almost all of the probes, and two or three 
additional sequences that were each recognized by at least two of 
the probes. One interpretation of the data is that there may be 
more than two pspA-homologous genes in some strains. The 
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significance of such sequences is far from established , It is of 

interest however, that although the additional sequences is far 

from established. It is of interest however, that although the 

additional sequences share areas of homology with the leader, a- 

helical, and proline region, they exhibited no homology with the 

repeat region of the O terminus of pspA* These sequences, thus, 

might serve as elements that can recombine with pspA and/ or the 

pspA-like sequences to generate sequence diversity. 

Alternatively the sequences might produce molecules with very 

different C-terminal regions, and might not be surface attached. 

If these pspA-like sequences make products, however, they, like 

PspA, may be valuable as a component of a pneumococcal antigenic, 

immunological vaccine compositions* 

Bacterial strains, growth conditions and isolation 
of chromosomal DNA . 

S. pneumoniae strains used in this study are listed in 
Table 5. Strains were grown in 100 ml of Todd-Hewitt broth with 
0.5% yeast extract at 37 °C to an approximate density of 5xl0 8 
cells/ml* Following harvesting of the cells by centrifugation 
(2900xg, 10 minutes) , the DNA was isolated as previously 
described and stored at 4°C in TE (lOmM Tris, ImM EDTA, pH 8*0). 
Amplification of pspA sequences . 

Polymerase chain reaction (PCR) primers, which were 
also used as oligonucleotide probes in Southern hybridizations, 
were designed based on the sequence of pspA from pneumococcal 
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strain Rxl. These oligonucleotides were obtained from Oligos 
Etc. (Wilsonville, OR) and are listed in Table 22. 

PCRs were done with a MJ Research, Inc. , Programmable 
Thermal Cycler (Watertown, MA) as previously described using 
approximately 10 ng of genomic pneumococcal DAN with appropriate 
5' and 3' primer pair. The sample was brought to a total volume 
of 50 ill containing a final concentration of 50mM KC1, lOmM Tris- 
HC1 (PH 8.3), 1.5 mM MgCl 2 , 0.001% gelatin, 0.5 mM each primer, 
200mM of each deoxynucleotide triphosphate, and 2.5 U of Tag DNA 
polymerase. Following overlaying of the samples with 50 ^1 of 
mineral oil, the samples were denatured at 94 °C for 2 minutes. 
Then the samples were subjected to 10 cycles consisting of 1 
minute at 94 °C, 2 minutes at 50 °C, and 3 minutes at 72 °C followed 
by another 20 cycles of 1 minute at 94°C, 2 minutes at 50°C, and 
3 minutes at 72 °C followed by another 20 cycles of 1 minute at 
94 °C, 2 minutes at 60°C, and 3 minutes at 72 °C. After all 30 
cycles, the samples were held at 72 °C for an additional 5 minutes 
prior to cooling to 4°C. The PCR products were analyzed by 
agarose gel electrophoresis. 
DNA hybridization analysis . 

Approximately 5/xg of chromosomal DNA was digested with 
Hindlll according to the manufacturer's instructions (Promega, 
Inc., Madison, WI) . The digested DNA was electrophoresesed at 35 
mV overnight in a 0.8% agarose gels and then vacuum-blotted onto 
Nytran membranes (Schleicher & Schuell, Keene, NH) . 
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Labeling of oligonucleotide with and detection of 
probe-target hybrids were both performed with the Genius System 
according to the manufacturer's instructions (Mannheim, 
Indianapolis, IN) . All hybridizations were done for 18 hours at 
42 °C without formamide. By assuming that 1% base-pair 
mismatching results in a l°C decrease in Tm designations of 
"high" and "low" stringency were defined by salt concentration 
and temperature of post-hybridization washes. Homology between 
probe and target sequences was derived using calculated Tm the 
established method. High stringency is defined as 90% or greater 
homology, and low stringency is 80-85% sequence homology. 
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Table 1£ Hybridization of oligonucleotides withtfrodlll restriction 

fragments of Rxl DNA. 




Region 

Leader 
a-helix 
a-helix 
a-helix 
Proline 
Proline 
Repeats 
C-terminus 




Oligonucleotide 
LSM12 
LSM5 
LSM3 
LSM4 
LSM7 
LSM1 
LSM10 

Note. Values indicated are the sizes of r^rriction fragments expressed as 



4.0 
4.0, 9.1 
4.0, 9.1 

Nix 

4.0, 9.1 



IS 

4.0,9.1 
4.0 
4.0 
4.0 

4.0, 9.1 
4.0, 9.1 
4.0, 9.1 
4.0 
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Table VS. Summary or nyondization at High and Low Stringency of 
8 Oligonucleotides with Hindm Restriction Fragments of 
the 17 Pneumococcal Isolates Listed in Figure * 
Percent Percent 

with £2 bands 



Oligonucleotide 




LSM12 
LSM5 
LSM3 
LSM4 

-LSM7 
^LSMl 

LSM10 
LSM2 



with riband 

Hj 

82 
29 
35 
29 
71 
65 
94 
53 



Percent 
with 1 band 



1 band/ 
£1 band 



65 
35 
94 



1< 



88 



41 
0 
71 
53 

41 



59 
18 
0 
0 
41 
12 
59 
12 



24 
35 
24 
47 

47 



24 
12 
35 
29 
29 
53 
35 
41 



!SS m V3lUC ? ^ ^ 17 strains were examined. If no value is 
listed, then no strains were examined. 



036 
1.00 
0.25 
0.47 

0.53 



0.29 
0.40 
1.00 
1.00 
0.42 
0.82 
0.37 
0.78 
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Table fft Amplification of Pneumococcal Isolates using the Indicated 5 V Prime 



Combination with the 3' Primer LSM2 at the 3* end of pspA 


5' Primer 


Region 


Nucleotide 


Amplified/ 


Perec* 


Position 


Tested 


AmpliC 


LSM8 


- 35 


47 to 70 


2/14 


14 


LSM12 


leader 


162 to 188 


8/14 


57 


LSM3 


a-helical 


576 to 598 


3/14 


21 


LSM7 


proline 


1093 to 1117 


12/14 


86 


LSM1 


proline 


1312 to 1331 


14/14 


100 


Note, by 2x1 


i cbi square analysis the differnet primers 







different frequencies of pspAs (F < 0.0001). The tendency for there 
to be more amplification with the 3* most primers was significant at 
P <0.0001. 



WO 97/09994 



PCT/US96/14819 




pspA Region 

length 
ex-helical 
Proline 

XxCUCfiCS 

Note: 



Table 

Primer Pairs number pspAs 

examined 




LSM12+LSM2 
LSM12 + LSM6 
LSM7+LSM9 
LSM1+ LSM2 
was ai 



9 
6 
3 
19 



Size 


Range 

» 


SX>. 


1.9-23 


OA 


an 


U-1.3 


0.4 


0.17 


0.23 


0 


0 


0.6-0.65 


0.05 


0.01 



panel of 19 different pspAs. 



Data is shown only for pspAs that could be amplified with the indicated primer pairs. 
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Tabled Pneumococcal strains 

Strain Relevant characteristics 

WU2 Capsular type 3, PspA type 1 

D39 Capsular type 2, PspA type 25 

R36A Nonencapsulated mutant of D39, 

PspA type 25 

Rxl Nonencapsulated variant of R36A t 

PspA type 25 

DBL5 Capsular type 5, PspA type 33 

DBL6A Capsular type 6A, PspA type 19 

A66 ^ll Capsular type 3* t PspA type 13 

AC94 Capsular type 9L, PspA type 0 

AC17 Capsular type 9L, PspA type 0 

AC40 Capsular type 9L, PspA type 0 

AC107 Capsular type 9V, PspA type 0 

AC100 Capsular type 9V t PspA type 0 

AC140 Capsular type 9N, PspA type 18 

D109-1B Capsular type 23, PspA type 12 

BG9709 Capsular ty^c 9, PspA type 0 

BG58C Capsular type 6A, PspA type ND 

L819Q5 Capsular type 4, PspA type 25 

* 

L82233 Capsular type 14, PspA type 0 

L82006 Capsular type 1 1 PspA type 0 
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LSM1 
LSM2 
LSM3 
LSM4 
LSM5 
LSM5 
LSM7 
LSM8 
L5M9 
LSM10 

LSM11 
LSM12 
LSM13 



CCGGATCCAUCTCCTGCACCAAAAAC 

CCGGATCCTOAGCCAGAGCAGTTGGCTO 
CCGGATCCGCIXL^ACAGATTGATOAGTCrO 

^AGTCQACTGGAGTrTCTGGAGCTGGAGC 

^^TS^^^ TAmACG ^AGGAGGC 
GTTTTTGGTGCAGGAGCTGG 

GCTATGGCTACAGGTIX3 

CCACCTGTAGCCATAGC 

CCGGATCCAGCGTGCCrATCTTAGGGGCrGGTr 
GCAAGCTTATGATATAGAAATrmTA An 
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EXAMPLE 6 - Restriction Fragment Length Polymorphisms of pspA 

Reveals Grouping 

Pneumococcal surface A (PspA) is a protection eliciting 
protein of Streptococcus pneumoniae. The deduced amino acid 
sequence of PspA predicts three distinct domains; an a helical 
coiled-coil region, followed by two adjacent proline-rich 
regions, and ten 20 amino acid repeats. Almost all PspA 
molecules are cross-reactive with each other in variable degrees. 
However, using a panel of monoclonal antibodies specific for 
individual epitopes, this protein has been shown to exhibit 
considerable variability even within strains of the same capsular 
type. Oligonucleotide primers based on the sequence of pspA from 
S, pneumoniae Rxl were used to amplify the full-length pspA gene 
and the 5' portion of the gene including the a-helical and the 
proline-rich region- PCR-amplif ied product were digested with 
Hha I or Sau3A I to visualize restriction fragment length 
polymorphism of pspA. Although strains were collected from 
around the world and represented 21 different capsular types, 
isolates could be grouped into 17 families or subfamilies based 
on their RFLP pattern. The validity of this approach was 
confirmed by demonstrating that pspA of individual strains which 
are known to be clonally related were always found within a 
single pspA family. 

Numerous techniques have been employed in 
epidemiological surveillance of pneumococci which include 
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serotyping, ribotyping, pulsed field electrophoresis, multilocus 
enzyme electrophoresis, penicillin-binding protein patterns, and 
DNA fingerprinting. Previous studies have also utilized the 
variability of pneumococcal surface protein A (PspA) to 
differentiate pneumococci. This protein, which can elicit 
protective antipneumococcal antibodies, is a virulence factor 
found on all pneumococcal isolates. Although PspA molecules are 
commonly cross-reactive, they are seldom antigenically identical. 
This surface protein is the most serologically diverse protein 
know on pneumococci; therefore, it is an excellent market to be 
used to follow individual strains. Variations in PspA and the 
DNA surrounding its structural gene have proven useful for 
differentiation of 5. pneumoniae. 

When polyclonal sera are used to identify PspA, cross- 
reaction is observed between virtually all isolates. Conversely, 
when panels of monoclonal antibodies are used to compare PspA of 
independent isolation they are almost always observed to express 
different combinations of PspA epitopes. A typing system based 
on this approach has limitations because it does not easily 
account for differences in monoclonal binding strength to 
different PspA molecules. Moreover, some strains are weakly 
reactive with individual monoclonal antibodies and may not always 
give consistent results. 

A less ambiguous typing system that takes advantage of 
the diversity of PspA was therefore necessary to develop and was 
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used to examine the clonality of strains. This method involves 
examination of the DNA within and adjacent to the pspA locus* 
Southern hybridizations of pneumococcal chromosomal DNA digested 
with various endonucleases, such as Hind III, Dra I, or Kpn I, 
and probed with labeled pspA provided a means to study the 
variability of the chromosome surrounding pspA. When genomic DNA 
is probed, the pspA and the pspA-like loci are revealed. In most 
digests the pspA probe hybridizes to 2-3 fragments and, digests 
of independent isolates were generally dissimilar. 

Like the monoclonal typing system, the Southern 
hybridization procedure permitted the detection of clones of 
pneumococci. However, it did not provide a molecular approach 
for following pspA diversity. Many of the restriction sites 
defining the restriction fragment length polymorphism (RFLP) were 
outside of the pspA gene, and it was difficult to differentiate 
the pspA gene from the pspA- like locus. In an effort to develop 
a system to follow pspA diversity Applicants examined the RFLP of 
PCR-amplif ied pspA. Amplified pspA was digested with Sau3A I and 
Hha I, restriction enzymes with four base recognition sites. To 
evaluate the utility of this approach pspA from clinical and 
laboratory strains known to be clonally related as well as random 
isolates were examined. 
Bacterial strains 

Derivatives of the S pneumoniae D39-Rxl family were 
kindly provided by Rob Massure and Sanford Lacks (Figure 8) . 
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Eight clinical isolates from Spain and four isolates from 
Hungary, a gift from Alexander Tomasz. Seventy-five random 
clinical isolates from Alabams, Sweden, Alaska, and Canada were 
also studied* 
PCR amplifications 

The oligonucleotide primers used in this study are 
listed in Table 24 . Chromosomal DNA, which was isolated 
according to procedures described by Dillard et al., was used as 
template for the PGR reactions. Amplification was accomplished 
in a 50 txl reaction containing approximately 50 ng template DNA, 
.25U Tag, 50 mM of each primer, 175 j»M MgCl 2 , and 200 jiM dNTP in 
a reaction buffer containing 10 jzM Tris-HCl, pH 9.0, 50/iM KCl, 
0.1% Triton X-100, 0.01% wt/vol. gelatin. The mixture was 
overlaid with mineral oil, and placed in a DNA thermal cycler. 
The amplification program consisted on an initial denaturation 
step at 94 °C, followed by 29 cycles opf 94 °C for 1 min, 55 °C for 
2 min, and 72 °C for 3 min. The final cycle included an 
incubation at 72 °c for 5 min. 

Restriction fragment analysis of PCR-amplif ied product 

Aliquots of the PGR mixtures were digested with Hha I 
or Sau3A I in a final volume of 20/il according to manufacturer's 
protocols. After digestion the DNA fragments were 
electrophoresed on a 1.3% TBE agarose gel and stained with 
ethidium bromide. Fragment sizes were estimated by comparison to 
a lkb DNA ladder (Gibco BRL) . 
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Because of the variability of pspA, and the fact that 
the entire pspA sequence is known for only one gene, it has not 
been possible to design primers which amplify pspA from 100% of 
pneumococcal strains. However, oligonucleotide primers, LSM2 and 
LSM1, can amplify an 800 bp region of the C-terminal end in 72 of 
the 72 stains tested. Based on hybridizations at different 
stringencies, this region was found to be relatively conserved in 
pneumococcal strains, and thus would not be expected to be 
optimal for following restriction polymorphisms within the pspA 
molecule* LSM13 and LSM2 , primers which amplify the full length 
pspA gene, can amplify pspA from approximately 79% 55/75 of the 
strains tested (Table 25) ♦ 

Stability of amplified RFLP pattern within clonally related 
pneumococci 

To determine the stability of pspA during long passages 
in vitro, we examined the RFLP pattern of the pspA gene of the 
derivatives of the S. pneumoniae D39-Rxl family. Rxl is an 
acapsular derivative of 5. pneumoniae D39, the prototypical 
pneumococcal laboratory strain isolated by Avery in 1914. 
Throughout the 1900 's spontaneous and chemical mutations have 
been introduced into D39 by different laboratories (Figure 8) . 
During this period unencapsulated strains were maintained in 
vitro, and D39 was passed both in vivo and in vitro passage. All 
the derivatives of D39, including Rxl, R6, RNC, and R36A, 
produced a 1.9 kb fragment upon PCR amplification of full length 

103 



WO 97/09994 



PCTAJS96/14819 



pspA. All members of the family exhibited the RFLP pattern. 
Digestion with Sau3A I of PCR amplified full length pspA revealed 
a .83, .58, .36 and a .27 kb fragment in all of the D39-rXl 
derivatives of the family- Digesting the full length pspA with 
Hha I resulted in bands which were ,76. .47, .39, .35, and .12 kb 
(Figure 9 or Table 26) . 

The stability of pspA polymorphism was also 
investigated using pneumococcal isolates which had previously 
been shown to be clonally related by other criteria, including 
capsule type, antibiotic resistance, enzyme electromorph, and 
PspA serotype. Three sets of isolates, all of which were highly 
penicillin resistant, were collected from patients during an 
outbreak in Hungary and two separate outbreaks in Spain. PCR 
amplified full length pspA from the capsular type 19A 
pneumococcal strains from the outbreak in Hungary, DB18, DB19, 
DB20, and DB21, resulted in a band approximately 2.0 kb. After 
digesting full length pspA with Hha I, four fragments were 
visualized., 89, .48, and .28 kb. Digestion with Sau3A I yielded 
five fragments .880, .75, .35, .34, and .10kb. Capsule type 6B 
pneumococcal strains, DB1, DB2, DB3, and DB4 , were obtained from 
an outbreak in Spain. Full length pspA from these strains were 
approximately 1.9 kb. Digestion of the PCR-amplif ied fragment 
with Hhs I resulted in four fragments which were .83, .43, .33, 
and .28 kb. Sau3A I digestion yield a .88, .75, .34, and .10 kg 
fragments . DBG , DB8, and DB9, which are capsular serotype 23F 
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strains, were isolated from a second outbreak in Spain. DB6, 
DB8, and DB9 had an amplified pspA product which was 2.0 kb. Hha 
I digested fragments were .90, .52, .34, and .30 kb and Sau3A I 
fragments were .75, .52, .39, .22, .20, and .10 kb in size 
(Figure 10) . DB7 had a 19 A capsular serotype and was not 
identical to DB6, DB8, and DB9 . In the D39/Rxl family and in 
each of the three outbreak families the size of the fragments 
obtained from the Hha I and the Sau3A I digests totaled 
approximately 2.0 kb which is expected if the amplified product 
represents a single pspA sequence. 

Diversity of RFLP pattern of amplified pspA from random 
pneumococcal isolates 

PCR amplification of the pspA gene from 70 random 

clinical pneumococcal isolates yielded full-length pspA ranging 

in size from 1.8 kb to 2.3 kb. RFLP analysis of PCR-derived pspA 

revealed two to six DNA fragments ranging in size from 100 bp to 

1.9 kb depending on the strain. The calculated sum of the 

fragments never exceeded the size of the original amplified 

fragment. Not all pneumococcal strains had a unique pspA, and 

some seemingly unrelated isolates from different geographical 

regions and different capsular types exhibited similar RFLP 

patterns. Isolates were grouped into families based on the 

number of fragments produced by Hha I and Sau3A I digests and the 

relative size of these fragments. 
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Based on the RFLP patterns it was possible to identify 
17 families with four of the families containing pairs of 
subfamilies. Within families all of the restriction fragments 
were essentially the same regardless which restriction enzyme was 
used. The subfamilies represent situations where two families 
share most but not all the restriction fragments. With certain 
strains an FRLP pattern was observed where detectable fragment 
size differed from the pattern of the established family by less 
than 100 bp. Since the differences were considered small 
compared to the differences in the fragment size and the number 
of fragments between families, they were not considered in family 
designation * The RFLP pattern of two isolates from six of the 
families is pictured in Figure 11, Table 27. These families were 
completely independent of the capsular type or the protein type 

i 

as identified by monoclonal antibodies (Table 28 and 29) . 

Previous DNA hybridization studies have demonstrated 
that the pspA gene of different isolates are the most conserved 
in their 3' region of the gene and more variable in the 5' region 
of the gene. Thus, if seemed likely that the differences in the 
pspA families reflected primarily differences in the 5' end of 
the gene. To confirm this theory, the a helical and proline 
region of pspA was examined without the amino acid repeats. 
Nucleotide primers LSM13 and KSH2 were used to amplify this 
fragment which is approximately 1.6 kb. Examination of this 
region of pspA afforded two things. 
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This primer pair permitted amplification of 90% of the 
strains which is greater than the 75% of the strains which can be 
amplified with oligonucleotides which amplify the full length 
gene. Second, it allowed Applicants to examine if the original 
groupings which were based on the full length gene coincide with 
the fingerprint patterns obtained by looking at the 5' half of 
the gene. 

Figure 12 contains the same strains which were examined 
in Figure 11 but the PCR products were amplified with SKH2 and 
LSM13. The RFLP patterns obtained from digestion of the 
amplified a helical and proline rich region confirms the original 
designated families- However, these primers amplify a smaller 
portion of the psaA and therefore the difference is the families 
is not as dramatic as the RFLP patterns obtained from the RFLP 
pattern of the full length gene. 

The polymerase chain reaction has simplified the 
process of analyzing pspA gene and have provided a means of using 
pspA diversity to examine the epidemiology of 5. pneumoniae* 
Because not all strains contained a unigue fingerprint of pspA, 
RFLP patterns of pspA cannot be used alone to identify the 
clonality of a strain. These results indicate the RFLP of PCR- 
amplified pspA from pneumococcal strains in conjunction with 
other technigues may be useful for identifying the clonal 
relatedness among pneumococcal isolates, and that this pattern is 
stable over long passages in vitro, 
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These findings suggests that the population of pspA is 
not as diverse as originally believed* PCR-RFLP of pspA may 
perhaps represent a relatively simplistic technique to quickly 
access the variability of the gene within a population. Further, 
these findings enable techniques to diagnose. S. pneumoniae via 
PCR or hybridization by primers on probes to regions of pspA 
common within groupings. 
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The sequence studies divide the known strains into several families based on 
sequence homologies. Sequence data demonstrates that there have been extensive 
recombinations occurring in nature within pspA genes. The net effect of the 
recombination is that the "families" identified by specific sequences differ dependinj 
upon which part of the pspA molecule Is used for analysis. "Families" or "grouping 
identified by the 5* half of the alpha-helical region, the 3* half of the a-helical regia 
and the proline rich region are each distinct and differ slightly from each other* In 
addition there is considerable evidence of other diversity (including base substitute 
and deletions and insertions in the sequences) among otherwise closely related 
molecules. 

This result indicates that it is expected that there will be a continuum of overlappin 
sequences of PspAs, rather than a discrete set of sequences. 

The findings indicate that there is the greatest conservation of sequence in the 3* ha 
of the a-helical region and in the immediate 5* tip. Because the diversity in the mid 
half of the a-helical region is greater, this region is of little use in predicting cross- 
reactivity among vaccine components and challenge strains. Thus, the sequence of 
3* half of the alpha-helical region and the 5 1 tip of the coding sequence are likely to 
the critical sequences for predicting PspA cross-reactions and vaccine composition. 

The sequence of the proline-rich region may not be particularly important to * 
composition of a vaccine because this region has not been shown to be able to elicit 

cross-protection even though it is highly conserved. The reason for this is presuma4/y 
because antibodies to epitopes in this region are not surface exposed. 

Based on our present sequences of 27 diverse pspAs we have found that then 
are 4 families of the 3' half of the a-helical region and 2-3 families of the very ? tip 
the a-helical region. Together these form 6 combinations of the 3' and 5' families. 
This approach therefore should permit us to identify a panel of pspAs with 3' and 5 
K*»1i«t s<mu«i«!s representative of the ereatest number of different rs»As. Jfce TO I 
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Table^Rdationship of Capsular type and RFLP family. 




pspA 
Frai l 

A 

B 

C 

D 
DD 

E 

F 
FF 

0 
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I 
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K 
KK 

L 
M 
MM 



1 2 3 4 5 6 6A 6B 7 8 9A 9L 9N 9V 10 11 12 13 14 15 19 22 23 31 33 35 



1 



1 
1 
1 



1 



1 
1 



2 1 



1 
1 



2 4 
1 1 



1 I 
I 



1 2 



1 



1 

1 1 



1 



1 



1 



1 



1 



ND 



1 



1 



1 
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Table 3& Oligonucleotide ueed In this study. 



Designation 


Saqusnee 5"-3' 


Nuelsotlds 
position 


L8M2 


QCQ CGT CQA CGG CTT 
AAA CCC ATT CAC CAT 
TOG 


• 

1990 to 1967 


L8M1 


CCQ QAT CCA GCT CCT 
OCA CCA AAA AC 


1312 to 1331 


LSM13 
SKH2 


OCA AGC TTATQATAT 
AflAAATTTGTAAC 

CCA CAT ACC QTTTTC 
TTG TTT CCA QCC 


1to26 
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Table 2& Amplification of pspA from a panel of 72 independent isolates 4 of S. 
pneumoniae. 



CAPSULE 
TYPE 


NUMBER OF 
STRAINS 
EXAMINED 


LSM13 AND 
LSM2 


LSM13 AND 
SKH2 






% OF STRAINS 
AMPURED 


% OF STRAINS 
AMPURED 


1 


o 


100 


100 


2 


4 
1 


100 


100 


3 


a 
O 


50 


87 


A 


6 


67 


100 


5 


1 


100 


100 


6 


7 


oft 

29 


86 


6A 




lUO 


100 


CD 

QO 


0 


100 


100 


7 


2 


50 


100 


8 


1 


100 


100 


9V 


3 


100 


100 


9A 


2 


1 WW 


ion 


9L 


1 


100 


100 


9N 


3 


100 


100 


10 


1 


100 


100 


11 


2 


50 


100 


12 


2 


0 


100 


13 


1 


100 


100 


14 


4 


0 


75 


15 


2 


50 


50 


19 


5 


100 


100 


22 


3 


33 


100 


23 


1 


100 


100 


33 


1 


0 


100 


35 


1 


0 


100 


nd 


3 


100 


100 



'Our strain collection contains several groups of isolates known to be previously to be clonal and 
collected for that purpose. The data reported in the table includes only one representative isolate 
from such clonal groups. 
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Table Rx1-D39 derivatives 





1 CXT I? "dlL T t~LT.ni j 

1 oLck Or aha I DIGESTS 


SEEOFSa«3AI 




(Kb) 


DIGESTS (Kb) 


D39 


.76, .47, .39, .35, .12 


.83, .53, .36, 27 


Rxl 


.76, .47, .39, .35, .12 


.83, .58, .36, 21 


R800 


76, .47, .39, .35, .12 


.83, .58, .36. 27 


R6 


76, .47, .39, .35, .12 


* 

.83, .58, .36, 27 


R61 


I 76, .47, .39, .35, .12 


.83, .58, .36, 27 


R6X 


.76, .47, .39, .35, .12 


.83, .58, .36, 27 


R36NC 


.76, .47, .39, .35, .12 


• 

■r 

.83, .58, .36. .27 


R36A 


I .76, .47, .39, .35. .12 


.83. .58. .36. .27 
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TABLE ^Strain information and family designation of independent isolates. 



STRAIN 


CAPSULE 


PsdA 


PA Mil V 


SIZE OF Hha 


■ 

1 SIZE OF Sau3A 1 1 




TYPE 


TYPE 




FRAGMENTS 


FRAGMENTS 


BG9163 


6B 


21 


r\ 
V/ 


1.55, .35 


1.05, .35. 52 I 


EF6796 


6A 


1 


C 


1.5, ,35 


1.05, .35,52 I 


EF5668 


4 




DD 


1.25, .49. .32 


1.0, .80, .35 I 


EF8616A 


4 


ND 


DD 


1.25, .49, .32 


1.0, .80, .35 I 


EF3296 


4 


20 


E 


1.0, .40, .33 


1.15, .50, .34 I 


EF4135 


4 


ND 


E 


1.0, .40, .33 


1.15, .50, .34 I 


BG7619 


10 


ND 


F 


1.3. .40, 59. .10 


1 

.82, .76, .35 1 


BG7941 


11 


ND 


F 


1.3, .40, .29, .10 


.82, .76, .35 1 


BG7813 


14 


8 


H 


1.05, .70, .36 


.90. .77, .35 1 


BG7736 


8 


ND 


H 


1.05, .70. .36 


.90, .77, .35 1 


AC113 


9A 


ND 


1 


1.4, .34.58 


15, .80 1 


AC99 


9V 


5 


1 


1.4. .34.58 


1.2. .80 I 
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TableV.Relationship of RFLP family and PspA type. 
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EXAHPLE 7 - Ability of PspA immunogens to protect against 

individual challenge strains 

CBA/N or BALB cJ mice were given 1 injection of 0.5 - 
/xg PspA in CFA, followed 2 weeks later by a boost in saline, and 
challenged between 7 and 14 (average 10) days post boost. 
Control mice were administered a similar immunization regimen, 
except that the immunization came from an isogeneic strain unable 
to make PspA, The PspA was either full length, isolated from 
pneumococ6i or cloned full length or BC100 PspA, as little 
statistical significance has been seen in immunogenicity between 
full length PspA and BC100, The challenge doses ranged from 
about 10 3 to 10 4 pneunocci in inoculum, but in all cases the 
challenge was at least 100 times LD 50 . 

The results are shown in the following Tables 30 to 60, 
and the conclusions set forth therein. 

From the data, it appears that an antigenic, 
immunological or vaccine composition can contain any two to 
seven, preferably three to five PspA, e.g., PspAs from R36A and 
BG9739, alone, or combined with any or all of PspAs from Wu2, 
Ef5668, and DB15. Note that surprisingly WU2 PspA provided 
better protection against D39 that did R36a/Rxl/D39, and that 
also surprisingly PspA from Wu2 protected better against BG9739 
than did PspA from BG9739. Combinations containing R36A, BG9739 
and WU2 PspAs were most widely protective; and therefore, a 
preferred composition can contain any three PspA, preferably 
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R36A, BG9739 and WU2 . The data in this Example shows that PspA 
from varying strains is protective, and that it is possible to 
formulate protective compositions using any PspA or any 
combination of the PspAs from the eight different PspAs employed 
in the tests. Similarly, one can select PspaS on the basis of 
the groupings in the previous Example. Note additionally that 
each of PspA from R36A, BG9739, EF5668 and DBL5 are, from the 
data, good for use in compositions. 




Applicants have chosen to express data as median (a non- 
parametric parameter) rather than averages because the times to 
death do not follow a normal distribution. in fact there are 
generally two peaks. One is around day 3 or 6 when most of the 
mice die and the other is at > 21 for mice that live. Thus, it 
becomes nonsensical to average values like 21 or 22 with values 
like 3 or 6. One mouse that lives out of 5 has a tremendous 
effect on such an average but very little effect on the median. 
Thus, the median becomes the most robust estimator of time to 
death of most of the mice. 
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Relative ability of different PspAs to Protect against each challenge strains of 5. 

pneumoniae 

(Summary of statistically significant protection) 



Vaccine Pap A 





R36A, 


JD908/ 
WU2 


JS1020/ 
BG9739 


EF3296|EF5668 


L819G5 


J55010J 
DBL5 


JS3020 
DBUA 


All 
immune 


best 
protect 


Challenge 


Caps 


FspA 


hspA 


Rxl.D39 






Strain 


type 


type 




K 


a 


b 


E 


DD 


b 


n 


D 






D39 


2 


25 


K 


++ 


+++ 






♦ 








+-f 


+++ 


WU2 


3 


1 


a 


+++ 


+++ 


+++ 




+++ 


+++ 


+++ 


+++ 


+++ 


+++ 


A66 


3 


13 


a 


+++ 


+++ 


+++ 




+++ 


+++ 


++■*- 


+± 


+++ 


«■++ 


EF10197 


3 


18 


M 


+++ 




+++ 












+++ 


+++ 


ATCC6303 


3 


7 


m 


+++ 
















+++ 


+++ 


BG9739 


4 


26 


b 


+ 


+++ 


+ 


0^ 


0 


+± 


0 


0 




+++ 


EF3296 


4 


20 


E 


+± 


+± 


0+ 








0 


0 


0 


+± 


EF5668 


4 


12 


DD 




0 


+++ 


0- 


+++ 


0+ 


♦ 


0* 


++ 


+++ 


L81905 


4 


23 


b 




+ 


++ 


++ 


0 


+ 




+± 




+ + 


DBL5 


5 


33 


n 


+ 




+ 




+ 


+ 


++ 


0 


++ 


++ 


EF6796 


6A 


1 


c 


+++ 
















+++ 


++♦ 


D5L6A 


6A 


19 


D 


+++ 


+± 


++ 


+± 


+++ 




+± 




++ 


+++ 


BG9163 


6B 


21 


C 


+++ 




+++ 












+++ 


+++ 


BG7322 


6B 


24 


C 


+++ 


+++ 


+± 


0 


+++ 


+± 


+++ 


+± 


+++ 


+++ 



Note: Empty cells indicate that no experiment has been done. Bold means significant at P < 0.05, 
Small font bold (♦) means 0.02 £P < 0.05. Large font bold means P < 0.02. For this table statistical 
significance refers to delay in time to death except as indicate in the (+) footnote below. When "all 
immune" showed significant protection against death but individual data cells did not the result for 
"all immune** is presented under best protection on the assumption that if more mice were done in 
each data cell one or more of them would have exhibited significant protection against death. 

+++ » statistically significant protection against death; £ 50% protection from death 
++ = statistically significant protection against death; < 50% protection from death 
+± = statistically significant delay in death; £20 protection from death 

+ = statistically significant delay in death; <20 protection from death, (or significant protection 

against death but not a significant delay in death) 
0++ Hoi statistically delay in time to death; but £ 50% protection from death 
0+ = Hot statistically delay in time to death; but >IS day extension in median time to death or £20% 

protection from death. 
0 = No apparent extension in time to death or protection from death. 
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Relative ability of different FspAs to Protect against each challenge stxains of S. 

pneumoniae 

(Expressed as Median days Alive post challenge) 









Vaccine FspA 


JU6A. 

OtUXft 


JO908 
WU2 


JS102D/ 
BC9739 


EF3296 


EF5668 


181905 


JS5Q10J 
DBU 


JS3020 
DBUA 


All 


All 


Challenge 


Caps 


PspA^spA 


Strain 




type 




K 


a 


b 


E | 


DO 


b 


a 


D 






D39 


2 


25 


K 


43 


>21 






4 








5 


2 


WU2 


3 


1 


a 


>21 


>21 


>21 




>21 


>21 


>21 


>21 


>21 


2 


A66 


3 


13 


a 


>21 


>21 


>21 




>21 


>21 


>21 


4 


>21 


2 


EF10197 


i 


m w 


tct 


>21 




>21 












>21 


2 


ATCCU03 


3 


7 


a 


>21 
















>21 


5 


BG9739 


4 


is 




3 


>21 


6 


3 


3 


5,13 


2 


2 


3 


2 


&3296 


4 


2d 




5 


5 


43 








1 


2 


3 


2 


EF5668 


4 


12 


w 


6 


2 


>21 


13 


>21 


4- 


>ai 


3 


8 


3 


L8190S 


4 


"2i 




5 


5 


a 


6 


3 


5 


3 


3J 


5 


2 


bBLS 




33 


TT 


4 




3 




3 


33 




2 


33 


2 


~EFS796 


6A 


1 


-*r 


>21 
















>21 


"f 


DBUA 


6A 


19 


u 


>21 


83 


13 


9 


>21 


• 


12 


>21 


123 


53 


BG9163 


6b 


TT 


c 


>21 




>21 












>21 


4.4 


bG73Z2 




24 


£ 


>2X 


>2l 


143 


6 


>21 


123 


>21 


11 


>21 


7 



Note: Bold denotes statistically significant esctenskm of life at P< O05. Saull font dcsotet simp coos? Urge font 
denote* F < OJE. Median tunes to death indicated as 8>21, are situations where the medium as not within a 
continuum of values. In those cases the numbers shown are those dosest to the median. In these cases the values 
give are those closest to the calculated median. Fractional values such as 33, indicate that the rriedlan is halfway 
between two numbers, in this case 3 and 4. As indicated in the original data (S1Q3B), some experiments were 
terminated prior to 21 days post infection. There is little reason to assume, however, that results would have been 
sign i fic a n tly effected by the early termination's since very few mice infected with the strains used in those studies, 
have ever been observed to die later than 10 or 15 days post challenge. For statistical purposes all mice alive at the 
end of experi m ents were assumed to have been completely protected, and for the sake of calculations aU surviving 
mice were assigned values of >21. 
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Ability of different FspAs to Protect Against Each Challenge strain of 

S. pneumoniae 

(Expressed as increase in survival time in days) 

(A denotes Z 50% immune mice alive) 



Vaccine PapA 





R36A. 
Rxl,D39 


JD908 
WU2 


IS1Q30/ 
BG9739 


EF32S6 

4*4 


EF5668 




DBL5 


DDL6A 


All 


PCX 


Challenge 


Caps 


VspfyspA 


Strain 


type 


type 




K 


a 


b 


E 


CD 


b 


n 


D 






D99 


2 


25 


K 


US 


A 






2 








3 


A 


WU2 


3 


I 


a 


A 


A 


A 




A 


A 


A 


A 


A 


A 


A66 


3 




a 


A 


A 


A 




A 


A 


A 


2 


A 


A 


EF10197 


3 


is 


M 


A 




A 












A 


A 


ATCGfiXD 


3 




a 


A 
















A 


A 


BG9739 


4 


26 


b 


1 


A 


4 


1 


l 


3,11 


0 


0 


1 


"A" 


EF3296 


4 


20 


E 


3 


3 


IS. 








0 


0 


l 


3 


EF5668 


4 


12 


Db" 


3 


-l 


A 


10 


A 


l 


A 


2 


5 


A 


L61905 


4 


23 


T" 


3 


3 


6 


4 


1 


3 


1 


L5 


3 


6 


DBL5 




33 


n 


2 




1 




1 


1.5 


4 


0 


1.5 


4 


EF6796 


6A 


1 


c 


A 
















A 


A 


DBL6A 


6A 


19 


D 


A 


3 


7.5 


3.5 


A 


IS 


63 


A 


7 


A 


BG9163 


6B 


21 


C 


A 




A 












A 


A 


BG7322 


6B 


24 




A 


A 


7.5 


-I 


A 


S3 


A 


4 


A 


A 






K36A 


WU2 


BC9739 


EF3296 


EF566& 


L81905 


DBL3 


DBL6A 


All 


Best 



Note Bold denotes statistically significant extension of life at P < 0.05. SmiU font dtaotcs oozsT <&os ; large font 
denotes P < 0.02. Median increases in survival listed as 3, 9 or 1 A derote groups where the median does not fall 
within a continuum of values. In these cases the values give axe those closest to the calculated median, 
Fractional values such as 3.5, indicate that the median is halfway between two numbers, in this case 3 and 4. 




Relative ability of different PspAs to Protect against each challenge strains of 

S. pneumoniae 
(expressed % alive at 21 days post challenge) 





Vaccine Pap A 


R36A, 
Rxl,D39 


JD90S 
WU2 


JS1020/ 
BC9739 


EF3296 


EF5668 


L81905 


JS50103 
DBL5 


JS3020 
DBL6A 


All 
snmune 


All 
control 


Challenge 


Caps 


PspA 


pspA 


Strain 


-SE. 


type 


family 


K 


a 


b 


E 


DO 


b 


11 


D 






D39 


2 


25 


K 


38 


60 






30 








38 


3 


WU2 


3 


1 


a 


100 


100 


100 




100 


100 


100 


100 


100 


"ii" 


A66 


3 


13 


a 


75 


100 


80 




75 


100 


60 


20 


76 • 


5 


EF10197 


3 


18 


M 


100 




ao 












90 


0 


ATCC&303 


3 


7 


a 


100 
















100 


0 


BG9739 


4 


26 


b 


"TT" 


60 


13 




— 3— 


25 


"TT 


B 


12 


0 


EF3296 


4 


20 


E 




20 


10 . 










6 


8 


0 


EF5668 


4 


12 


CO 


22 


25 


60 


40 


100 


40 


to 


0 


41 


9 


L8190S 


4 


23 


T> 


10 


0 


31 


40 


0 


0 


14 


0 


14 


0 


DBL5 


5 


33 


n 


10 




14 




0 


0 


29 


0 


4 


0 


EF6796 


6A 


1 


-fi 


100 
















100 


0 


DBL6A 


6A 


19 


u 


67 


25 


13 


0 


60 


25 


0 


80 


35 


4 


BG9163 


6B 


21 


c 


89 








80 








86 


20 


BG7322 


6B 


24 


c 


100 


60 


25 


0 


89 


25 


80 


25 


55 


6 



Bold, denotes statistically significant protection against death at P < 045. Bold small font indicates significant 
protection against death at 0D2 £ P < 0.05. Bold large font indicates significant protection against death at P < 042. 
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Relative ability of difft*-nt PspAs to Protect against each chalice strain of 

S. pneumoniae 



(% ciatected from death at 21 days post challenge) 

Vaccine FspA 





R36A, 

Rxl,D39 


WU2 
JD908 


BC9739 
J51Q20 


EF3296 


EF5668 


L819G5 


DBL5 
JS5010-3 


D8L6A 


All 


Best 


Challenge 


fans 


PspA^spA 


Strain 




2EL 


famu) 


K 


■ 


b 


E 


ED 


K 

V 


n 

U. 


n 






039 


2 


25 


K 


36 


59 






21 








36 


59 


"WU2 


3 


1 


a 


100 


100 


100 




100 


100 


100 


100 


100 


100 


A66 


3 


ii 


a 


71 


100 


79 




74 


100 


58 


16 


75 


100 


EFlflitf 


3 


it 


M 


100 




ao 












90 


100 


ATCCUK 


3 


i 


a 


100 
















100 


100 


BG9739 


4 


26 


b 


11 


60 


13 


23 


0 


25 


0 


0 


12 


60 


EF3294 


4 


20 


E 


23 


20 


10 








0 


0 


a 


■ 2T 


E&68 


4 


12 


LJD 


U 


IS 


Si 


34 


100 


34 


5* 


-10 


35 


100 


L81905 


4 


if" 


b 


10 


0 


31 


40 


0 


0 


14 


0 


14 


40 






"5T 


n 


10 




14 




0 


0 


29 


0 


4 


29 


U ll/arw 


6A 


1 


c 


100 
















100 


100 


DBL6A 


6A 


19 


D 


66 


22 


30 


-4 


58 


22 


-4 


79 


33 


79 


BG9163 


6B 


21 


C 


86 




75 












83 


86 


8G7322 


6B 


24 


c 


100 


57 


22 


0 


88 


22 


79 


22 


52 


100 



Bold, denotes artistically significant protection against death at P < 0.05. Bold small font indicates significant 
protection against death at 0D2iP< 0.05. Bold Urge font indicates significant protection against death at P<0JQ2. 
% protected has been corrected for any survivors in the control mice 

% protected » 100 x (% alive in immune - % alive in control)/(100 -% alive in control). Thus, if there were any 
mice alive in the control animals, the calculated "% protected" is less than the observed "% alive 1 * listed in the 
previous table. The only exceptions to this are if 100% of Immunized mice lived. Negative numbers mean that 
less immunized mice lived than did control mice. Please note that none of these negative numbers are significant 
even though we are using a one tailed test 
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Tarn 55 

Recommended Immunogens to Protection against the indicated challenge 
strains of S» pneumoniae Based on Protection Score 
Based on median days alive and percent protected 
(numbers refer to preference as a vaccine strain with respect to the indicated 

challenge strain, 
Is best) 





Vaccine FapA 


R*LD39 


WU2 
JD908 


BC9739 


EF329* 


EFSftt 


U190S 


DBL5 
JS50UL3 


DSUA 


Challenge 


Caps 


PspA^SpA 




Strain 




type 


(unify 


TT 


a 


b 


E 


CD 


b 


n 


D 


D39 


2 


25 


K 


2 


1 






3 








WU2 


3 


1 


a 


1 


1 


1 




1 


1 


1 


1 


I 


A66 


3 


\i 


a 


2 


1 


2 




2 


1 


3 


— ar 




3 


18 


M 


1 




2 












ATCC63K 


3 


7 


a 


1 
















BG9739 


4 


26 


T 


3 


1 


2 


3 


3 


2 


0 


0 








4 


20 


E 


1 


1 


2 








0 


0 


EF5668 


4 


12 


DO 


0 


0 


2 


3 


1 


6 


2 


0 


LB1905 


4 


23 


b 


2 


— 5— 


1 


1 


0 


0 


0 


5™ 


DBL5 


5 


33 


n 


2 




3 




0 


3 


1 


0 


£ 






6A 


1 


£ 


1 
















DBL6A 


6A 


19 


D 


2 


0 


3 


0 


2 


0 


0 


1 


BG9163 


6B 


21 




1 




1 












BG7322 


6B 


24 


c 


1 


2 


3 




1 


3 


1 


3 


Number of #l*s 


7 


5 


3 


1 


3 


2 


3 


2 



Bold, denotes statistically significant protection against death at P < 0X15. Where more than one FspA were 

equally protective, the same values were given to each. Recommendations are based on days to 
death with % protection dividing ties, especially among those where greater than 50% of mice lived 
to 21 days. "0" indicates test were conducted but compared to the other PspAs this one is not 
recommended. 

Conclusions: 

Statistically significant protection against death with >50% protection; 11/14 of the strains = 79% 
Statistically significant protection against death; 13/14 strains = 93% 
Statistically significant extension of life in 14/14 or 100% of strains. 
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Criterion 
2#1 PspAfor 
each challenge 
strain 



>onents as o f 95/8/27 
Vaccine Component 
(cumulative strains protected) 
% maximally protected 



2#2 Psp A for 
each challenge' 

strain 
Max score 
(+)type 
score 
Max 
Increase in 
Days alive 



1 

R36A 

(7) 
50% 

R36A 

(12) 
86% 

R36A 

(9) 
64% 

R36A 

W 
64% 



2_ 

WU2 

(10) 
71% 

BG9739 

(12) 
100% 
WU2 

(U) 
79% 

wur 
(11) 

79% 



BG9739 

(11) 
79% 



4 | 


5 


EF5668 


D8L5 


(12) 


(13) 


86% 


93% 



6_ 
DDL6A 

100% 



% protected 



hcoretica] 
mixture based 
on a lew 
testable 
assumptions 
(see below) 



R36A 

(7) 
50% 
R36A 

(10) 
64% 



WU2 

(10) 
64% 

BG9739 

(12) 
86% 



BG9739 


DBL5 


(13) 


(U) 


92% 


100% 


BG9739 


DBL5 


(13) 


(M) 


92% 


100% 


DBL5 


EF5668 


(ID 


(12) 


79% 


86% 


DBL5 


EF3296 


(13) 


(14) 


92% 


100% 



DBL6A 
(13) 
92% 



EE3296 

(14) 
100% 



* This is not a unique combination. See table below. 

Combinations where all Challenge Strains have a Vaccine strain 

with a score of £#2 

Number* Combin-tionl 
PspAs in "1 strains 

Combination 



R36A + WU2 + 
EF5668 



R36A + WU2 ♦ 
DDLS 




11 



15 



22 



WO 97/09994 



PCT/US96/14819 



Pooled Data for Protection against D39 by various PspAs; 
Days alive for each mouse 



Exp. 


Log 

CFU 
D39 


Mice 


Days to Death/ immunogen 


Rxl/R36A 
D39 


JD908 
(WU2) 


EF566S 


rUI Immune 


control 


143 


4.5 


CBA/N 






MJU.2 




1,102J 


E145 


4.0 


CBA/N 


2.333.4 








1.1.2.3.4 


BBS 
BCC 


5.93 


BALB/c 


3,3x>21 










£143 


3.0 


CBA/N 






2.6J>x>10 




33.3.5.5 


E140 
BC100 


ZM 


CBA/N 


4X5,7,15 










E169 


2.7 


CBA/N 


2.4x>2t 


2A3x >21 






1Z12J 


E154 


2.6 


CBA/N 


!.Z3,2x >21 








4x2,5,>21 


All 
S30 






2JJ3A4.4 
5.7,15 




1.UZ2 




4x1,6x2. 
3J.4 


All 






4x2,5x3, 
3x4,5,7, 
1S.9*>21 


2A3x>2l 


U.3,2.2,2.6 
3x>21 


UJx* 
5x3,3x4, 
5,5.6,7,15. 

I5x>21 


5x1,16x2, 
6x3,4,4, 
5^>21 



Pooled Data for Protection against D39 by various PspAs 

Median Days Alive & alive : dead 

with corresponding P values. 



Exp. 


Log 

CFU 


Mice 


Rxl/R36A 
DOT 


JD9G8 
(WU2) 


EF5666 


AUImmufW 


ConliQl 




D39 




med a:d 


mtd aid 


med a:d 


med »:d 


med| »:d 


143 


43 


COA/N 






2 0:5 




E145 


4.0 


CBA/N 


3 0:5 

ILS. 








2 0:5 


£028 BCC 


5.93 


OALB/c 


>21 3:1 
429 








2 0:4 


£143 


34 


COA/N 






>21 3:2 
tv*. rus. 




E140 BC100 


X81 


CBA/N 


5 0:5 
0.018 








2 0:3 


E169 


2.7 


CBA/N 


>21 4:1 
4116 -024 


>21 3:2 
416 n-*. 






2 0:5 


E154 


2.6 


CBA/N 


3 2:3 

tV». M. 








2 1:5 


All S3.0 






4 0:10 
4008 




2 0:5 


1 2 0:13 


AH 






43 9:15 
.0057 401 


>21 3:2 
406 4045 
+++ 


4 O*) 3:7 
ILS. 434 
♦ 


5 15:24 
400) • oow 


2 1:32 


X Aliv* 






38 


€0 


30 


38 


3 








36 


59 


. 28 


36 1 








SUVD39 


WU2 


EF5668 


All Immune | control* 



WO 97/09994 



PCT/US96/14819 



-fflftgfo 

Pooled Data for Protection against WU2. by various PspAs 



Exp. 


CFU 

WU2 


Mice 








Days to Death/ tmmunogen 


FL-R36A 


Rxl 
BC100 


joioa 

(WU2) 


JS1020 


BC9739 

DCJUU 


EF5668 


U1905 
bclOO 


DBL5 

i j|M 

bclOO 


JS3020 
(DDL6A) 


control 


Dr. Ed. expL 






















tots of prior expts. 


* . . 






















'3.0 


CBA/N 


15x>21 


















1,1,11x2, 
7*3,4 


E02S 


6.01 


BALBA 


4x>21 
003 /a.*. 


















<6,6,>21 


£084 


X75 l 


CBA/N 








3x>15 












>13 


£125 

bctOO 


157 


CBA/N 










4x>21 




4x>21 


4x>21 




2A33A 
>2l 


E129 


3.18 


CBA/N 








Sx>23 












2,2.2,2,3 


E140 

BC100 


3.43 


CBA/N 




4x>21 
















1*3*2.3,4 


£143 


3.0 


CBA/N 












8x>10 








UA2A3 


E144 


3.9 


CBA/N 


















Sx>21 


5x2 


£172 


3.98 


CBA/N 






5x>21 














5x3 ■ 


All 






Mx>21 


4x>21 


5x>21 


8x>21 


4x>21 


8x>21 


4x>21 


U>2\ 


Sx>21 


6x1.33x2, 
20x3, 

>21 


All Immune 


61x>21 



Pooled Data for Protection against WU2 by various PspAs 



J 



Exp. 


CFU 

WU2 


Mice 


Median days Alive 
Alive: Dead 
P value based on Alive : Dead 

P value calculated compared to pooled controls (in this cue 65 control mice) 

Score 


FL-R36A 


Rxt 
BO00 


JD108 

(WU2) 


JS1020 
(B09739) 


BC9739 
bclOO 


EF5668 


L819Q5 
bclOO 


PBL5 
bclOO 


JS3020 
(DBL6A) 


control 


Dr. Ed, expt 


















+++ 




lots of prior expo. 


+++ 




















£012 


-3.0 


CBA/N 


>21 
15:0 


















U.SU2, 
7X3,4 


£028 




8 ALBA 


4x>21 


















4,6.6.>2l 


£084 


3JS* 


CBA/N 








3x>15 












UA2A3L 
MS 


£125 
btm 


Ts* 


CBA/N 










4x>21 




4x>21 


4x>21 




2A34A 
>21 


£129 


3.18 


CBA/N 








5x>23 














E140 

BC100 


3.43 


CBA/N 




4x>21 
















U5xX3w4 


E143 


3-0 


CBA/N 












8x>10 










£144 


3.9 


CBA/N 


















3x>21 


5x2 


E172 


3.98 


CBA/N 






5x>21 














5x3 


All 






>21 
19:0 

<xxm 


>21 
4:0 

-cOOOl 

4-H- 


>21 
5:0 
<A001 
+++ 


>21 
8:0 
<.0001 


>21 
4:0 
<.0001 


>21 
8:0 
<JX»1 


>21 
4:0 
<.0001 


>21 
.4:0 
<.0001 


>21 
4:0 
<.0001 
+♦+ 


2 
1:64 


X alive 


100 


100 


100 


100 


100 


100 


100 


100 


100 


2 


























FL-R36A 


Rxl 
BCJ00 


JD10B 
(WU2) 


JS1Q20 
(BC9739) 


BG9739 
bclOO 


EF5668 


181905 
bclOO 


DBLS 
bclOO 


JS3020 
(DBL6A) 


control 



WO 97/09994 



PCT/US96714819 



V7U2 
CfcalUazt 


days of death 


mediin days 
of death 


a_.a : 
dead 


P value 
based on days 
lo death 


P value 
based on 
alive :dead 


Score 


alive 


% 
prot. 


—711 

immune 


61x>21 


>21 


61:0 


<.O001 


<,000l 


+++ 


100 


100 


All 
controls 


6x1, 33x2, 20x 
3, 4,4.4.6,6. >21 


2 


iM 








1 


1 



Pooled Data for Protection against A66. by various FspAs 



Exp. 


CFU 
A66 




Days to Death/ ixnmunogen 


FL-R36AA 
D39 


Rxl 
BO00 


JO90B 
(WU2) 


JS1020 
(BG9739) 


BG9739 
bclOO 


EF5668 


L81905 
Ft 


U1905 
bclOO 


JS5010JFL 
(DBL5) 


DBL5 
bclOO 


JS3020 
(DDL6A) 


control - 


£169 


260 


C8A/K 


5x>21 




Sx >21 


















U.2.2.6 


£152 
fecKO 


278 












4*>21 






4x>2l 




4x>21 




3x2.3 6 6 
>21 


E104 


3.0 


C9A/K 








2 MM >22 










3,4,4.2* >22 




2.4.4J.>22 




E143 


3.0 














4,4x 

>10 


















4x >21 




















1.1.1 


E172 


3.94 


C0A/N 














5x >21 












£145 


3.97 


CflA/h 


13,4x>21 






















1.Z2.2.4 


£121 


4.16 




3x3,2x4, 
5*>21 






















l,8x2,>21 


All 






3x3,2x4. 
I3,14x>21 


4x>21 


5x>21 


2^.3x >21 


4x>21 


4.4x 

>21 


5x>21 


4x>21 


3,4,40* >21 


4x>21 


2,4,4^21 


7x1,22x2, 
3x3,4,3x6 
2x>2l 


median; 
A: D 


>21 
14:6 


>21 
4:0 


>21 

5:0 


>21 

3:2 


>21 

5:0 


>21 
4:1 


>21 
5:0 


>21 
4:0 


4 

2:3 


>21 
4:0 


4 
1:4 


2 

2:36 


P values 


<aoooi 

<0.0001 


0.0002 
00001 


cO.0001 
<0.0001 


O004 

0.0075 


0*0002 
<0.0001 


QJW06 
0.006 


<0-0001 
cO.0001 


O0002 
0.0001 


0.0025 


0JQ002 
0.0001 


om$ 

rtf. 






























Mini Pools 


BLH/U&mV 


HVC44.1 


JD908 


BG9739 


EF5668 


L81905 


DBL5 
3,4.4.4,6x>2t 


DBL6A 


Control 




>21 
18:6 


>21 
5:0 


>21 
8:2 


>21 
4:1 


>21 

9:0 


>21 
6:4 


4 

1:4 


2 

2:36 


P vaiaues 
rank/a:d 


<QJO0Q\ 


<0JXM1 
<0.0001 


<0.0001 


O0006 
0-006 


<0.0001 


0.0004 


0.015 
its. 




Score 
% alive 


72 
71 


100 
100 


80 
79 


75 
74 


100 
100 


60 
58 


+1 
20 
16 


5 
0 


A66 challenge |W*A/Rximti«.i 


JD90B 


BG9739 


EF5668 


181905 


DBL5 


DBL6A 





A66 challenge 


day* of death 


median 
days alive 


alive : 
dead 


P- days to 
death 


P - alive : 
dead 


Score 


alive 


protected 


Ail immune 


2,2.4x3,7x4,5, 
8,l3,50x>21 


>21 


50:16 


<0JX»1 


<0.0001 


++4- 


76 


75 


AU controls 


7x 1,22x2,3x3, 
4,3x6. 2x>21 


■ a~ 


" 1:34 








5 


0 



77)0^3 

1 Pooled Data for Protection against EF10197. by various PspAs | 



Exp. 


CFU 

EF 

10197 


Mice 


Days to Death/ immunogen 


Rxl 
BC100 


JS1020 
(BC97391 


U1905 


JS3020 
(DBL6A) 


EFS668 


I5501O3 FL 
fOBL5)0 


control 


£140 


3.00 


CBA/N 


Sx >21 












20.2 


V41BCC 


2.70 


CBA/N 


• 














£129 


334 


CBA/N 




8. 4x>23 










2X2.2,9 



* This was a passive protection study. Its control* have been included to increase the nuotpcrs ot control mice. 



WO 97/09994 



PCT/US96/14H19 



Pool of Pools for protection agaisnt EF10197 



line 


Group 
Description 


Delay in death and/or survival 


Survival 


days to death (medain) 


P value* tic 


UlV«:deAd 


P values etc 


U 


Rxl (£140) 


5x>21 


O0l7vslb 
0.0W3 vs 4b 


5:0 


0X18 v» lb 
0408 vs 4b 




JS1020 (E129) 


S.4x>23 


0.0007 n 3b 


4:1 


0X24 v* 3b 


4a i 


all immune 


B.9x>21 


<O0001vi4b 


9:1 


0X002vs4b 


lb 


Rxl comroh (E140) 


222 




0:3 




2b 


MIDCC 


22222 




0:5 




3b 


J51020cont(E129) 


222X9 




0:5 




4b 


all control* (without Ml BCC) 


2222222,9 




0:8 







Summ 


ary of protection against EF10197 




alive : dead 


% alive 


% protected 


median DOD 


P time alive 


P alive : dead 


Score' 


Rxl 


5:0 


too 


100 


>21 


0X17 


OJ018 




JS1020 


4:1 


80 


80 


>21 


00007 


OQ24 




aU immune 


9:1 


90 


90 


>21 


<0X001 


0X002 




all controls 


0:8 


0 


0 


2 




e»a> 





Fooled Data for Protection against ATCC6303. by various 
PspAs 



Exp. 


CFU 

ATCC 
6303 


Mice 


Days to Death/ immunogen 


Rxl 
BC1M 


JS1020 
(BC9739) 


LS1905 


JS3020 
(DBL6A) 


EF5668 


JS50NUFL 
(DBL5)0 


control 


El 40 


2.30 


CBA/N 


5x>21 












4.4x5 


E129 


3.80 


CBA/N 




n.v. 













Pool of Pools for protection agaisnt ATCC6303 



line 


Group 
Description 


Delay in death and/or survival 


Survival 


days to death (medain) 


P values etc, 


altv«:de*d 


P values etc 


1* 


Rxl (E140) 


5*>21 (>21) 


0X040 


5:0 


0.004 


lb 


RX1 controls (E140) 


4.4x5 5 




0:5 





Summary of protection against ATCC6303 | 


Inwnunoeen 


alive : dead 


% alive 


X protected 


median POD 


P time alive 


P alive : dead 


Score* 


Rxl 


5:0 


100 


100 


>21 


0.004 


0X04 


+++ 


Rxl controls 
• _ .1.4;. 


0:5 


0 


0 


5 









I Tti&ve V e 



|lo* 



WO 97/09994 



PC17US96/14819 



-Thus 



Fooled Data for Protection against BG9739. by various FL Psp As 


fcxp. 


CFU 

DG9739 


Mice 


Days to Death/ immunogen 


R36A 
FL 


■>a*^4 am 

BC100 
(Rxl) 


JU908 
(WU2) 


[BC9739] 


L 

bclQO 
(BC973 
9) 


U*329o 
FL 


EF5668 
FL 


bclw 
(111905) 


FL 
(OBLS) 


bcltf) 

(DBL5] 


JS3030 
[DBU 
A\ 


control 


E140 


2.76 


COA/N 




3,3,10,12 






















£104 


Z89 


Xid 


















2^3.4 




3 


2A3.5.5 


E125 




CBA/N 










MM 






4,5,13, 
>21 




12X4 




6 


E172 


71 
O./ 1 


CBA/N 






6,7,3x 
>21 


















3.4A6.7 


E124 


376 


Xid 


















2.2.2.2,3 




2.2.2,2, 


i.7.2.7.2 




9 




E084 


4.05 


BALB/c 








4x2, 2x 
>14 
















9x2 


E144 


4.09 


Xid 


1 










>to 












UA3.3 






























































All 






1 


3,3,10,11 


&,7,3x >o; 


4x2,6,6, 
7.8,8, 
2*>2l 




>21 


2^,3.3,4 




7x2,3,3 
4 




8*2,3, 
9 


21x2, 
7x3L3» 
4.3x5. 
3x6.7 








3 


3. 10 


>21 


6 


5 


3 


3 


5, 13 


2 


2 


2 


. 2 


a :d 






1:4 


0:4 


3:2 


2:9 


0:4 


1:4 


0:5 


1:3 


0:10 


0:4 


0:10 


0:38 


P rank 












* 


















Pad 































Pooled Data for Protection against BC9739. by bclOOs and FL PspAs 



Exp. 


CFU 
BG973S 


Mice 


Days to Death/ immunogen 


K36A 
FL 


BC100 
(Rxl) 


JD908 1 
(WU2) 


JS1020! 
[BG9739] 


bclOO 
(BC973 
») 


EF3296 
FL 


EF5668 
FL 


bclOO 
(LS1905) 


JS5010J 

FL 
(DBL5) 


bclOO 
(D8L5) 


;53020 
[DBU 
A) 


coniiol 


E140 


2J6 


CBA/N 




33.10,11 




















2A3 


E104 


7JS9 


Xid 








6,6.7,8.8 










2XU.4 




3 




E125 


356 


CBA/N 










5.5,57 






4A13, 

>21 




2A2.4 




L3,4,4> 
6 


E172 


3.71 


CBA/N 






6.7,3x 
>21 


















3,46A7| 


E124 


3.76 


Xid 






















2.2X2, 
9 


tVYaTC 






E084 


4.05 


BALB/c 








4x2, 2x 
>14 
















9x2 


E144 


4.09 


Xid 


2A3A 
>21 










U.3,7. 
>10 


2JM4 












FL + bclOCBG9739 


R36A/F 


Lxl/039 


WL2 


BG9 




EF3296 


EFS468 


L81905 


DS 


IS 


DBL6A 


Cont 


Ail 


L4x3,«,iail*>21 


6.7,3x 
>21 


4x2,3x5,2x6, 
2x7,2x8. 2x>2l 


>21 


2^x3,4 


4A13. 
>21 


10x2.3,3,4,4 


8x2,3. 
9 


21x2. 
7xX3x 
13x3. 
3x6.7 


median day* alive 


3 


>21 


« 


3 


3 


3,13 


2 


2 


2 


alive ; dead 


l.a 


3:2 


2:13 


1:4 


0:5 




0:14 


0:10 


0:38 


P - da vi alive 


0.0096 


<0.0001 


0.0013 


fix. 


IU. 


0.0022 


n.a. 






P- abve : dead 


TLX. 


0.0008 


in 


ax. 


A*. 


nx- 


ILL 


nj. 




Score 


♦ 




+ 


0> 


0 


♦t 


0 


0 




% attve 


11 


60 


13 


25 


0 


25 


0 


0 


0 


% p rot acted 


11 


60 


13 


25 


0 


25 


0 


0 


0 


BG9739 challenge 


R36A/Rxl/D39 


WU2 


BG9739 


EF3296 


EF5668 


L81905 


DBL5 


DBL6A 


Cont 



WO 97/09994 



PCT/US96/14819 



BC9739 


dayso/ 
death 


median 
daysof 
death 


aim 
deaa 


P value 
based on 
days to 
death 


lvalue 
based on 
alive : 
dead 


£core 


% Alive 


% 








3 


05 




6m' 


♦+ 


12 


12 




All controls 




2 


0:3i 












E84 1 
E14D 
E104 



E124 
£172 



CFU 

EF3296 



3.99 
2.92 



3.U 
3.94 
4.06 



BALB/c 
CBA/N 



CBA/N 
CBA/N 



CDA/N 



All 



median days to death 



alive : dead 
P - days to death 



r 



Rxl 
BC100 



3,4.6,>21 



3,4,6,>21 



1:3 
0.0077 



EJF3296 
challenge 



Rxl 
BCXU 



Days to Death/ immunogen 



JD908 
WU2 



JS1020 
(BC9739) 



JS5010JFL 
(DBL5) 



4*2,>14 



3333,>21 



1:4 



0.0094 




J DM8 

VVU2 



4x2.4.3x5, 
6. >2l 



43 



1:9 



n.j. 



JS1G2Q 
(BG9739) 



20333 



1,1333 



1,13x2,33 



0:9 



IS501O3FL 
(D0L5) 



JS3020 
fDBUA) 



203,43 



U333 



1,13x2.3, 
4.5 



1,1,15x2,3x3 
4.4x6 



0:10 



JSM20 
(DDL* A) 



control 



9x2 



333 



? 7.7.T4 



3.4x6 



0:27 



control 



EF3296 I median 
challenge |day» alive 


alive : 
dead 


P - days 
to death 


P -alive : 
dead 


Score 


% 
alive 


% 

prat 


All immune 


3 


3:35 


in. 


1L1. 


0 


8 


B 




2 












hoi. 



WO 97/09994 



PCT/US96/14819 



Summary of protection against EF6796 


Inunui MMtui 


alive : dead ( % alive 


% protected 


median OOO 


P -lime ative 


P alive va dead 


Rxl 


4:0 1 100 


100 


>21 


0.029 


0.029 


control! 
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EXAMPLE 8 - Ability of PspA inununogens to protect against 

individual challenge strains 

In example 7 some of the capsular type 2, 4, and 5 
strains were not completely protected from death by immunization. 
In these studies the BALB/cByJ mouse was used instead of the 
hypersusceptible, immunodef icient CBA/N mouse used for the 
Example 7 studies* With the BALB/cJ mouse it was observed that 
immunization with PspA was in fact able to protect against death 
with capsular type 2, 4, and 5 pneumococci. This result is shown 
in the table below. 

The data from Table 60A also demonstrates that a 
mixture of 4 - 5 full length PspAs was as effective, or more 
effective than immunization with a single PspA* 
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EXAMPLE 9 - CHARACTERIZATION OF PspA EPITOPES WITHIN 

PNEUMOCOCCAL fi TO »T M8 M g„., fl C " WITHIN 

The strains examined came from a group of 13 capsular 
serotype 6B strains which have been identified that are members 
of a multiresistant clone, having resistance to penicillin, 
chloramphenicol, tetracycline, and some have acquired resistance 
to erythromycin. The pneumococcal isolates described in the 
following studies (MC25-28) are members of this 6B clone. 
Although previously thought to be geographically restricted to 
Spain (unlike the widespread multiresistant Spanish serotype 23F 
clone) , members of this clone have been shown to be responsible 
for an increase in resistance to penicillin in Iceland (Scares, 
S., et al., J. Infect. Dis. 1993; 168 ; 158-163). 

The following techniques were used to characterize the 
location of difference PspA epitopes: 

Bacterial cell culture. Bacteria were grown in Todd-Hewitt broth 
with 0.5% yeast extract or on blood agar plates overnight at 37-c 
in a candle jar. Capsular serotype was confirmed by cell 
agglutination using Danish antisera (Statens Seruminstitut, 
Copenhagen, Denmark) . The isolates were subtyped as 6B by 
Quellung reaction, utilizing rabbit antisera against 6A or 6B 
capsule antigen. 

Bacterial lysates. cell lysates were prepared by incubating the 
bacterial cell pellet with 0.1% sodium deoxycholate, 0.01% sodium 
dodecylsufate (SDS) , and 0.15 M sodium citrate, and then diluting 
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the lysate in 0.5M Tris hydrochloride (pH 6.8). Total 
pneumococcal protein in the lysates was quantitated by the 
bicinchoninic acid method (BCA Protein Assay Reagent; Pierce 
Chemical Company, Rockford, IL) . 

PspA s&rotyping. Pneumococcal cell lysates were subjected to 
SDS-PAGE, transferred to nitrocellulose membranes, and developed 
as Western blots using a panel of seven MAbs to PspA. PspA 
serotypes were assigned based on the particular combination of 
MAbs with which each PspA was reactive. 

Colony immunoblotting . A ten mL tube of Todd-Hewitt broth with 
0.5% yeast extract was inoculated with overnight growth of MC25 
from a blood agar plate. The isolate was allowed to grow to a 
concentration of 10 7 cells/mL as determined by an O.D. of 0.07 at 
590nm. MC25 was serially diluted and spread-plated on blood agar 
plates to give approximately 100 cells per plate. The plates 
were allowed to grow overnight in a candle jar, and a single 
blood agar plate with well-defined colonies was selected. Four 
nitrocellulose membranes were consecutively placed on the plate. 
Each membrane was lightly weighted and left in place for 5 min. 
In order to investigate the possibility of phase-variation 
between the two proteins detected on Western blots a single 
colony was picked from the plate, resuspended in ringer's 
solution, and spreadplated onto a blood agar plate. The 
membranes were developed as Western blots according to PspA 
serotyping methods. 
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When the strains MC25-28 were examined with the panel 
of seven MAbs specific for different PspA epitopes, all four 
demonstrated the same patterns of reactivity (Fig, 14) . The MAbs 
X1R278 and 2A4 detected a PspA molecule with an apparent 
molecular weight of 190 kDa in each isolate. In accordance with 
the PspA serotyping system, the 190 kDa molecule was designated 
as PspA type 6 because of its reactivity with XiR278 and 2A4, but 
none of the five other MAbs in the typing system. Each isolate 
also produced a second PspA molecule with an apparent molecular 
weight of 82 kDa. The 82 kDa PspA of each isolate was detected 
only with the MAb 7D2 and was designated as type 34 ♦ No 
reactivity was detected with MAbs Xil26, Xi64, 1A4 , or SR4Wr. 
Results from the colony immunoblotting showed that both PspAs 
were present simultaneously in these isolates under in vitro 
growth conditions. All colonies on the plate, as well as all of 
the progeny form a single colony, reacted with MAbs XiR278, 2A4, 
and 7D2. 

EXAMPLE 10 - SOUTHERN BLOT ANALYSIS OP CHROMOSOMAL DNA ISOLATED 

FROM PNEUMOCOCCAL STRAINS MC25-28 

Pneumococcal chromosomal DNA was prepared by the 

Youderian method (Sheffield, J.S. ( et al., Biotechniques, 1992; 

12: 836-839) . Briefly, for a 500 ml culture in THY or THY with 

1% choline, cells were centrifuged at 8000 rpm in GSA rotor for 

30 minutes at 4°C* The supernatant was decanted, and the cells 

were washed with 1 to 2 volumes of sterile water to remove 



114 



WO 97/09994 



PCT/US96/14819 



choline, if used. This step was only necessary when sodium 
deoxycholate was used. The wasted cells were centrifued twice a 
8000 rpm in GSA rotor for 10 minutes. Cells were resuspended in 
3.5 ml TE buffer, containing 1% SDS or 1% sodium deoxycholate, 
and incubated at 37 °c for 15 minutes if sodium deoxycholate was 
used. If SDS was used, incubation at 37 °C was not necessary. 
The cells were incubated at 65°C for 15 minutes, and 1/5 volume 
of 5.0 M potassium acetate was added, and the cell suspension was 
incubated for 3 0 minutes at 65 °C. 

The cells were placed on ice for 60 minutes, and 
centrifuged at 12,000 rpm in an SS-34 rotor for 10 minutes. The 
supernatant was transferred to a clean centrifuge tube, and 2 
volumes of cold 95% ethanol was added. After mixing, DNA was 
spooled on to a glass pasteur pipet, and air dried. The DNA was 
resuspended in 4 ml TE, and 4.0 g cesium chloride was added. The 
solution was split into two aliquots in ultracentrifuge tubes, 
and the tubes were filled to their maximum capacity using 
1.0 g/ml cesium chloride in TE. Before closing the tubes, 300 ml 
of 10 ug/ml ethidium bromide was added. 

The solution was centrifuged at 45,000 rpm overnight, 
or for 6 hours at 55,000 rpm. The chromosomal band was extracted 
using a gradient, at least 6 times with 1 volume each salt- 
saturated isopropanol. The aqueous phase was extracted by adding 
2 volumes 95% ethanol. The DNA came out of solution immediately, 
and it was spooled on to a pasteur pipet. The DNA pellet was 
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washed by dipping the spooled DNA in 5 ml 70% ethanol. The DNA 
was air dried, and resuspended in the desired volume of TE, e.g., 
500 ul. 

The cells were harvested, washed, lysed, and digested 
with 0.5% (st/vol) SDS and l00Mg/mL proteinase K at 37 °c for 1 h. 
The cell wall debris, proteins, and polysaccharides were 
complexed with 1% hexadecyl trimethyl ammonium bromide (CTAB) and 
0.7M sodium chloride at 65°C for 20 min., and then extracted with 
chloroform/ isoamyl alcohol* DNA was precipitated with 0.6 
volumes isopropanol, washed, and resuspended in lOmM Tris-HCl, 
lmM EDTA, pH 8.0. DNA concentration was determined by 
spectrophotometry analysis at 260 nm (Meade, H.M. et al., J. 
Bacteriol 1982; 149 : 114-122; Silhavy, T.J. et al., Experiments 
with Gene Fusion , Cold Spring Harbor: Cold Spring Harbor 
Laboratory, 1984; and Murray, M.G. , et al., Nucleic Acids Res. 
1980; 8 4321-4325) . 

ProbB preparation. 5' and 3' oligonucleotide primers homologous 
with nucleotides to 26 and 1967 to 1990 of Rxl pspA (LSM13 and 
LSM2, respectively) were used to amplify the full length pspA and 
construct probe LSMpspA13/2 from Rxl genomic DNA. 5' and 3' 
oligonucleotide primers homologous to nucleotides 161 to 187 and 
nucleotides 1093 to 1117 (LSM12 and LSM6, respectively) were used 
to amplify the variable a-helical region to construct probe 
LSMpspA12/6. PCR generated DNA was purified by Gene Clean 
(BiolOl Inc., Vista, CA) and random prime-labeled with 
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digoxigenin-ll-dUTP using the Genius l Nonradioactive DNA 
Labeling and Detection Kit as described by the manufacturer 
(Boehringer Mannheim, Indianapolis, IN) . 

DNA electrophoresis. For Southern blot analysis, approximately 
lOjig of chromosomal DNA was digested to completion with a single 
restriction endonuclease (Hind III, Kpn 1, EcoRI, Dra I, or Pst 
I), then electrophoresed on a 0.7% agarose gel for 16-48 h at 35 
volts. For PCR analysis, 5/uL of product were incubated with a 
single restriction endonuclease (Bel 1, BamH I, Bst I, Pst I, Sac 
I, EcoR I, Sma I, and Kpn I), then electrophoresed on a 1.3% 
agarose gel for 2-3 h at 90 volts. In both cases, 1 kb DNA 
ladder was used for molecular weight markers (BRL, Gaithersburg, 
MD) , and gels were stained with ethidium bromide for 10 min and 
photographed with a ruler. 

Southern blot hybridization. The DNA in the gel was depurinated 
in 0.25N HC1 for 10 min, denatured in 0.5M NaOH and 1.5M NaCl for 
30 min, and neutralized in 0.5M Tris-HCl (pH 7.2), 1.5M NaCl and 
lmM disodium EDTA for 30 min. DNA was transferred to a nylon 
membrane (Micron Separations INC, MA) using a POSIBLOT pressure 
blotter (Stratagene, LaJolla, CA) for 45 min and fixed by UV 
irradiation. The membranes were prehybridi2ed for 3 h at 42 °C in 
50% formamide, 5X SSC, 5X Denhardt solution, 25mM sodium 
phosphate (pH 6.5), 0.5% SDS, 3% (wt/vol) dextran sulfate and 
500/xg/mL of denatured salmon sperm DNA. The membranes were then 
hybridized at 42 °C for 18 h in a solution containing 45% 
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formamide, 5X SSC, IX Denhardt solution, 20mM sodium phosphate 
(pH 6.5), 0.5% SDS, 3% dextran sulfate, 250/ig/mL denatured 
sheared salmon sperm DNA, and about 20ng of heat-denatured 
digoxigenin-labeled probe DNA. After hybridization, the 
membranes were washed twice in 0.1% SDS and 2X SSC for 3 min at 
room temperature* The membranes were washed twice to a final 
stringency of 0.1% SDS in 0.3X SSC at 65°C for 15 min. This 
procedure yielded a stringency greater than 95 percent. The 
membranes were developed using the Genius 1 Nonradioactive DNA 
Labeling and Detection Kit as described by the manufacturer 
(Boehringer Mannheim, Indianapolis, IN) . To perform additional 
hybridization with other probes, the membranes were stripped in 
0.2N NaOH/0.1% SDS at 40°C for 30 min and then washed twice in 2X 
SSC. 

PCR. 5' and 3' primers homologous with the DNA encoding the N- 
and C-terminal ends of PspA (LSM13 and LSM2, respectively) were 
used. Reactions were conducted in 50/iL volumes containing 0.2mM 
of each dNTP, and 1m1» of each primer at a working concentration 
of 50mM. MgCl 2 was used at an optimal concentration of l.75mM 
with 0.25 units of Tag DNA polymerase. Ten to thirty ng of 
genomic DNA was added to each reaction tube* The amplification 
reactions were performed in a thermal cycler (M.J. Research, 
Inc.) using the following three step program: Step 1 consisted 
of a denaturing temperature of 94 °C for 2 min; Step 2 consisted 
of 9 complete cycles of a denaturing temperature of 94 °C for 1 
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min, an annealing temperature of 50 °C for 2 min, and an extension 
temperature of 72 °C for 3 min; Step 3 cycled for 19 times with a 
denaturing temperature 94 °c for l min, an annealing temperature 
of 60 *C for 2 min, and an extension temperature of 72 °C for 3 
min; and at the end of the last cycle, the samples were held at 
72 °C for 5 min to ensure complete extension. 

Band size estimation. Fragment sizes in the molecular weight 
standard and in the Southern blot hybridization patterns were 
calculated from migration distances. The standard molecular 
sizes were fitted to a logarithmic regression model using Cricket 
Graph (Cricket Software, Malvern, PA) . The molecular weights of 
the detected bands were estimated by entering the logarithmic 
line equation obtained by Cricket Graph into Microsoft Excel 
(Microsoft Corporation, Redmond, WA) in order to calculate 
molecular weights based on migration distances observed in the 
Southern blot. 

Since most strains contain a pspA gene and a pspC gene, 
it was expected that if an extra gene were present one might 
observe at least three pspA homologous loci in isolates MC25-28. 
In Hind III digests of MC25-28 each strain revealed 7.7 and 3.6 
kb bands when probed with LSMpspA13/2 (Figure 15A and 15C) . In 
comparison, when Rxl DNA was digested with Hind III and 
hybridized with LSMpspA13/2, homologous sequences were detected 
on 9.1 and 4.2 kb fragments, as expected from previous studies 
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with PspA (Figure 15A) . Results consistent with two pspA- 
homologous genes in MC25-28 were obtained with two pspA- 
homologous genes in MC25-28 digested using four additional 
enzymes (Table 61) . 

Table 61. Chromosomal RFLPs with probe LSMpspA13/2 for isolates MC25-28 and 



?nz^ Cti ° n St " inS Examined Restriction Fragments 
1 — — _ (sizes in kilobases) 

MC25 MC26 MC27 MC28 RX1 MC25-28 Rx a 

Hind III + + + + + 

Kpn I + + + + + 

EcoR 1 + 



Dra I + + 

Pst I + 



+ 



7 - 7 * 3.6 9.1, 4*2 
11.6, 10.6 10.6, 9.8 

8 - 4, 7.6 7.8, 6.6 
2-1* 1-1 1.9, 0.9 

6.1 10.0, 4.o 



The four isolates examined are all members of a single 
clone of capsular type 6B pneumococci isolated from Spain. These 
four isolates are the first in which two PspAs have been 
observed, i.e., PspA and PspC, based on the observation that 
bands of different molecular weights were detected by different 
MAbs to PspA. Mutation and immunochemistry studies have 
demonstrated that all of the different sized PspA bands from Rxl 
are made of a single gene capable of encoding a 69kDa protein, 
supporting the assertion that two PspAs have been observed, i.e., 
PspA and PspC. 

It has been observed that probes for the 5' half of 
pspA (encoding the a-helical half of the protein) bind the pspC 
sequence of most strains only at a stringency of around 90%. 
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With chromosomal digests of MC25-28, it was observed that the 5' 
Rxl probe LSMpspA12/6 (Figure 15D) bound two pspA homologous 
bands at even higher stringency . The same probe bound only the 
pspA containing fragment of Rxl at the higher stringency (Figure 
15B) . 

Further characterization of the pspA gene was done by 
RFLP analysis of PCR amplified pspA from each strain. Since 
previous studies indicated that individual strains yielded only 
one product, and since the amplification was conducted with 
primers based on a known pspA sequence, it was assumed that the 
product amplified from each strain represented the pspA rather 
than the pspC gene. When MC25-28 were subjected to this 
procedure, an amplified pspA product of 2.1 kb was obtained from 
each of the four strains. When digested with Hha I, this 
fragment yielded bands of l.l, 0.46, 0.21 and 0.19 kb for each of 
the four isolates. A single isolate, MC25, was analyzed with 
eight additional enzymes. Using each restriction enzyme, the sum 
of the fragments was always approximately equal to the size of 
whole pspA (Figure 16). These results suggested that the 2.1 kb 
amplified DNA represents the amplified product of only a single 
pspA gene. Rxl produced an amplified product of 2.0 kb and five 
fragments of 0.76. 0.468, 0.390, 0.349 and 0.120 kb when digested 
with Hha I as expected from its known pspA sequence. 

There are several possible explanations for the 
observation of PspA and PspC in these strains but not in other 
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strains. All isolates might make PspA and PspC in culture, but 
MAbs generally recognize only PspA (perhaps, in this isolate 
there has been a recombination between pspC DNA and the pspC 
locus, allowing that locus to make a product detected by MAb to 
PspA) . All isolates can have PspA and PspC, but the expression 
of one of them generally does not occur under in vitro growth 
conditions. The pspC locus is normally a nonfunctional 
pseudogene sequence that, for an unexplained reason, has become 
functional in these isolates. Results from the colony 
immunoblotting of these isolates failed to show a detectable in 
vitro phase shift between either PspA type 6 (XIR278 and 2A4) or 
PspA type 34 (7D2) protein. This strengthens the second 
explanation, and suggests that the second PspA is these isolates 
is due to the pspC gene not being turned off during in vitro 
growth conditions* 

Presumably, in these four strains, the second PspA 
protein is provided by the pspC DNA sequence. At high 
stringency, the probe comprising the coding, region of the a- 
helical half of PspA recognized both pspA homologous sequences of 
MC25-18, but not the pspC sequence of Rxl. The finding indicated 
that the pspC sequence of MC25-28 is more similar to the Rxl pspA 
sequence than the Rxl pspC sequence. If the pspC sequence of 
these strains is more similar to pspA than most pspC sequences, 
it could explain why the products of pspC genes cannot generally 
be identified by MAbs. 
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Example 11 - Identification of conserved and variable regions 

of pspA and pspC sequences of S. pneumoniae . 

The S. pneumoniae strains used in this study are listed 
in Table 62* The strains are human clinical isolates 
representing 12 capsular and 9 PspA serotypes. All strains were 
grown at 37 °C in 100ml of Todd-Hewitt broth supplemented with 
0.5% yeast extract to an approximate density of 5 x 10 8 cells/ml ♦ 
After harvesting of the cells be centrif ugation (2900 g, lOmin) , 
the DNA was isolated, and stored at 4°C in TE (10 mM Tris, ImM 
EDTA, pH8.0) . 
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Table 62. Streptococcus pneumoniae strains used. 



Strain Relevant phenotype Reference 

WU2 Capsular type 3, PspA type 1 Briles et al., 1981 

D39 Capsular type 2, PspA type 25 Avery et al., 1944 

R36A Nonencapsuated mutant of D39, Avery et al. # 1944 

pspA type 25 

Rxl Derivatitve of R36A, PspA type 25 Shoemaker and Guild, 1974 

DBL5 Capsular type 5, PspA type 33 Yother et al., 1986 

DBL6A Capsular type 6A, PspA type 19 Yother et al., 1986 

* 

A66 Capsular type 3, PspA type 13 Avery et al., 1944 

AC94 Capsular type 9L, PspA type 0 Waltman et al., 1992 

AC17 Capsular type 9L, PspA type 0 Waltman et al., 1992 

AC40 Capsular type 9L, PspA type 0 Waltman et al., 1992 

AC107 Capsular type 9V, PspA type 0 Waltman et al., 1992 

AC100 Capsular type 9V, PspA type 0 Waltman et al., 1992 

AC140 Capsular type 9N, PspA type 18 Waltman et al., 1992 

D109-1B Capsular type 23, PspA type 12 KcDaniel et al-, 1992 

BG9709 Capsular type 9, PspA type 0 McDaniel et al., 1992 



L81905 Capsular 
L82233 Capsular 
L82006 Capsular 



type 4, PspA type 25 
type 14, PspA type 0 
type 1, PspA type 0 



McDaniel et al., 1992 
McDaniel et al., 1992 
McDaniel et al., 1992 



Approximately 5jig of chromosomal DMA was digested with 
HindlTI according to the manufacturer's instructions (Promega r 
Inc., Madison, WI) . The digested DNA was subjected to 
electrophoresis at 35 mV overnight in 0-8% agarose gels and then 
vacuum-blotted onto Nytran® membranes (Schleicher & Schuell, 
Keene, NH) . 
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The oligonucleotides uses were based on the previously 
determined sequence of Rxl pspA. Their position and orientation 
relative to the structural domains of Rxl pspA are shown in 
Figure 17* Labeling of oligonucleotides and detection of probe- 
target hybrids were both performed with the Genius System® 
according to manufacturer's instructions (Boehringer-Mannhein, 
Indianapolis, IN) . All hybridizations were done for 18 hours at 
42 °C without formamide. By assuming that 1% base-pair 
mismatching results in a 1°C decrease in T m arbitrary 
designations of "high" and "low" stringency were defined by salt 
concentration and temperature of post-hybridization washes. 
Homology between probe and target sequences was derived using 
calculated T^ by established methods. High stringency is defined 
as £ 90%, and low stringency is < 85% base-pair matching. 

PCR primers, which were also used as oligonucleotide 
probes in Southern blotting and hybridizations, were designed 
based on the sequence of pspA from pneumococcal strain Rxl. 
These oligonucleotides were synthesized by Oligos, Etc. (Wilson, 
OR), and are listed in Table 63. 
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Table 63 • Oligonucleotide sequences. 



Primer 


5' -> 3' 


LSM111 


CCGGATCCAGCTCCTGCACCAAAAC 


LSH2 


GCGCGTCGACGCTTAAACCCATTCACCATTGG 


LSM3 


CCGGATCCTGAGCCAGAGCAGTTGGCTG 


LSM4 


CCGGATCCGCTCAAAGAGATTGATGAGTCTG 


LSM5 


GCGGATCCCGTAGCCAGTCAGTCTAAAGCTG 


LSM6 


CTGAGTCGACTGGAGTTTCTGGAGCTGGAGC 


LSM7 


CCGGATCCAGCTCCAGCTCCAGAAACTCCAG 


LSM9 


GTTTTTGGTGCAGGAGCTGG 


LSMXO 


GCTATGGCTACAGGTTG 


LSM12 


CCGGATCCAGCGTGCCTATCTTAGGGGCTGGT 


LSM112 


GCGGATCCTTGACCAATARKRACGGAGGAGGC 



PCR was done with an MJ Research, Inc. , Programmable 
Thermal Cycler (Watertown, MA) , using approximately 10 ng of 
genomic pneumococcal DNA as template with designated 5' and 3' 
primer pairs. The sample was brought to a total volume of 50 /il 
containing a final concentration of 50 mM KC1, 10 mM Tris-HCl (pH 
8.3), 1.5mM MgCl 2 r 0.01% gelatin, 0.5 /*M of each primer, 200 /iM 
of each deoxynucleoside triphosphate, and 2.5 U of Tag DNA 
polymerase. The samples were denatured at 94°C for 2 minutes and 
subjected to 10 cycles consisting of: 1 min at 94°C, 2 min at 
50°C, and 3 min at 72°C, followed by 20 cycles of: 1 min at 94°C, 
2 min at 60°C, and 3 min at 72°C. After 30 total cycles, the 
samples were held at 72°C for an additional 5 min prior to 
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cooling to 4°C. The amplicons were then analyzed by agarose gel 
electrophoresis . 

Oligonucleosides were used to probe Hindlll digests of 
DNA from 18 strains of S. pneumoniae under conditions of low and 
high stringency. Each strain was also screened using a full- 
length pspA probe. Table 64 summarizes the results for each 
strain under conditions of high stringency. Strain Rxl is a 
laboratory derivative of the clinical isolate D39 and 
consequently, both strains showed identical hybridization 
patterns and are a single column in Table 64. 
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The only strain which did not have more than one pspA- 
homologous Hindlll fragment was WU2, which was previously shown 
using a full-length pspA probe. Even at high stringency, six of 
the eight probes detected more than one fragment in at least one 
of the 18 strains (Table 64). L5M7, 10 and 12 hybridized with 
two fragments in more than one-half of the strains, and the 
fragments detected by the oligonucleotide probes were identical 
in size to those detected by the full-length pspA probe. 
Moreover, the same pairs of fragments were frequently detected by 
probes derived from the 3' as well as the 5' region of Rxl pspA. 
These results suggested that the Hindlll fragments from different 
isolates include two separate but homologous sequences, rather 
than fragments of a single pspA gene. Based on the diversity of 
the hybridization patterns and the size of restriction fragments, 
it is clear that pspA and pspC sequences are highly diverse and 
that these loci have considerable sequence variability as 
determined by location of Hindlll recognition sites. 

Oligonucleotides which hybridize with a single 
restriction fragments in each strain were assumed to be specific 
for pspA. At high stringency, LSM3 and LSM4 detected only a 
single Hindlll fragment in the strains with which they reacted. 
Restriction fragments containing homology to LSM3 or LSM4 were 
the same as those which hybridize with all of the other 
homologous probes. This suggested that LSM3 and LSM4 
specifically detect pspA rather than the pspC sequence. That 
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LSM3 hybridizes with a single restriction fragment of WU2 further 
confirmed that this oligonucleotide is specific for pspA. 
Sequences from the portion of the gene encoding the second 
proline region (LSM111) and the C-terminus (LSM2) appeared to be 
relatively specific for pspA since they generally detect only one 
of the Hindlll fragments of each strain* 

Oligonucleotides LSM12 and LSM10 were able to detect 
the most conserved epitopes of pspA and generally hybridize with 
multiple restriction fragments of each strain (Table 65) . LSM7 
was not as broadly cross-reactive , but detected two pspAs in 41% 
of strains including almost 60% of the strains with which it 
reacts- Thus, sequences representing the leader, first proline 
region, and the repeat region appear to be relatively conserved 
not only within pspA but between the pspA and pspC sequences* 
LSM3, 4, and 5 hybridize with the smallest number of strains of 
any oligonucleotides (29-35 percent) , suggesting that the a- 
helical domain is the least conserved region within pspA. In 
strains BG58C and L81905 oligonucleotides detect more than two 
Hindlll fragments containing sequences with homology to pspA. 
Because of the absence of Hindlll restriction sites within any of 
the oligonucleotides it was unlikely that these multiple 
fragments result from the digestion of chromosomal DNA within the 
target regions. Also, the additional restriction fragments were 
detected at high stringency by more than one oligonucleotide* 
Possibly, in these two strains, there are three or four sequences 
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with DNA homology to some portions of pspA. The probes roost 
consistently reactive with these additional sequences are those 
for the leader, the alpha-helical region, and the proline-rich 
region . 

The oligonucleotides used as hybridization probes were 
also tested for their utility as primers in the polymerase chain 
reaction (PGR) . Amplification of pspA from 14 strains of S. 
pneumoniae comprising 12 different capsular types was attempted 
with the primers listed in Table 63. LSM2, derived from the 3' 
end of pspA, were able to amplify an apparent pspA sequence from 
each of 14 pneumococcal strains when used in combination with 
LSM111, which is within the sequence of pspA encoding the 
proline-rich region. Combinations of LSM2 with primers upstream 
in pspA were variably successful in amplifying sequences (Table 
65) , The lowest frequency of amplification was observed with 
LSM112 which was derived from the Rxl sequence 5' to the pspA 
start site. This oligonucleotide was not used in the 
hybridization studies. DNA fragments generated by PCR were 
blotted and hybridized with a full-length pspA probe to confirm 
homology to pspA* 

Further evidence for variability at the pspA locus 
comes from the differences in the sizes of the amplified pspA 
gene. When PCR primers LSM12 and LSM2 were used to amplify the 
entire coding region of PspA, PCR products from different 
pneumococcal isolates ranged in size from 1.9 to 2.3 kbp. The 
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regions of pspA which encode the a-helical, proline-rich f and 
repeat domains were amplified from corresponding strains and 
variation in pspA appears to come from sequences within the a- 
helical coding region. 

Table 65. Amplification of psph by PGR using the indicated 

oligonucleotides as 5' primers in combination with the 
3' - primer L8M2* 



5' - primer 


Domain 


Amplified/ 
Tested 


Percent 
Amplified 


LSM112 


-35 (upstream) 


2/14 


14 


LSM12 


leader 


8/14 


57 


LSM3 


a-helical 


3/14 


21 


LSM7 


proline 


12/14 


86 


LSM111 


proline 


14/14 


100 



These studies have provided a finer resolution map of 
the location of conserved and variable sequences within pspA. 
Additionally, regions of divergence and identity between pspA and 
the pspC sequences have been identified* This data confirmed 
serological studies, and demonstrated that pspA and pspC 
sequences are highly variable at the DNA sequence level. The 
diversity of Hindlll restriction fragment polymorphisms contained 
pspA and the pspC sequence supported earlier data using larger 
probes that detected extensive variability of the DNA in and 
around these sequences. 
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A useful pspA-specif ic DNA probe would identify Rxl and 
WU2 pspA genes, in which restriction maps are known, and would 
identify only a single restriction fragment in most strains. Two 
probes, LSM3 and LSM4, do not hybridize with more than one 
Hindlll restriction fragment in any strain of pneumococcus . Both 
of these oligonucleotides hybridize with Rxl pspA and LSM3 
hybridizes with WU2 pspA. However, each of these probes 
hybridize with only four of the other 15 strains. When these 
probes identify a fragment, however, it is generally also 
detected by all other Rxl-derived probes. Oligonucleotides from 
the second proline-rich region (LSM111) and the C-terminus of 
pspA (LSM2) generally identify only one pspA-homologous sequence 
at high stringency. Collectively, LSM111, 2, 3 and 4 react with 
16 of the 17 isolates and in each case revealed a consensus DNA 
fragment recognized by most or all of the oligonucleotide probes. 

When an oligonucleotide probe detected only a single 
DNA fragment it was presumed to be pspA* If the probe detected 
multiple fragments, it was presumed to hybridize with pspA. If 
the probe detected multiple fragments, it was presumed to 
hybridize with pspA and the pspC sequence. Based on these 
assumptions the most variable portion between pspA and pspC is 
the region immediately upstream from the -35 promoter region and 
that portion encoding the ot-helical region. The most conserved 
portion between pspA and pspC was found to be the repeat region, 
the leader and the proline-rich region sequences. Although only 
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one probe from within the repeat region was used, the high degree 
of conservation among the 10 repeats in the Rxl sequence makes it 
likely that other probes within the repeat sequences would give 
similar results. 

The portion of Rxl pspA most similar to the pspC 
sequence was that encoding the leader peptide, the upstream 
portion of the proline-rich region, and the repeat region. The 
repeat region of PspA has been shown to be involved in the 
attachment of this protein to the pneumococcal cell surface. The 
conservation of the repeat region within pspC sequences suggests 
that if these loci encode a protein, it may have a similar 
functional attachment domain. The conservation of the leader 
sequence between pspA and the pspC sequence was also not 
surprising since similar conservation has been reported for the 
leader sequence of other proteins from gram positive organisms, 
such as M protein of group A streptococci (Haanes-Fritz, E. et 
al., Nucl. Acids Res. 1988; 16: 4667-4677). 

In two strain, some oligonucleotide probes identified 
more than two pspA-homologous sequences. In these strains, there 
was a predominant sequence recognized by almost all of the 
probes, and two or three additional sequences share homology with 
DNA encoding the leader, cr-helical, and proline region, and they 
have no homology with sequences encoding the repeat region in the 
C-terminus of PspA. These sequences might serve as cassettes 
which can recombine with pspA and/ or the pspC sequences to 
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generate antigenic diversity * Alternatively, the sequences might 
encode proteins with very different C-terminal regions and might 
not be surface attached by the mechanism of PspA. 

Oligonucleotides which hybridize with a single 
chromosomal DNA fragment were used as primers in PCR to examine 
the variability of domains within pspA. These results 
demonstrate that full-length pspA varies in size among strains of 
pneumococci, and that this variability is almost exclusively the 
result of sequences in the alpha-helix coding region. 
Example 12 - Cloning of PspC 

Chromosomal DNA from S. pneumoniae EF6796, serotype 6A 
clinical isolate, was isolated by methods including purification 
through a cesium chloride gradient, as described in Example 8. 
The Hindll-EcoRI fragment of EF6796 was cloned in modified pZero 
vector (Invitrogen, San Diego, CA) in which the Zeocin-resistance 
cassette was replaced by a kanamycin cassette (shown in Figure 
18) . Recombinant plasm ids were electroporated into Escherichia 
coli TOP10F' cells [F' {lad^Tet*} mcrA A (mrr-hsdRMS-mcrBC) 
<f>801acZAM15 dlacX74 deoR recAl araD139 A (ara-leu) 7967 galU galK 
rpsL endAl nupG] (Invitrogen) • 

The 5' region of pspA.Rxl does not hybridize to pspC 
sequence at high stringencies by Southern analysis. Utilizing 
both the full-length Rxl pspA probe, and a probe containing the 
sequence encoding a-helical region of PspA, it was possible to 
identify which DNA fragment contained pspA and which fragment 
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contained the pspC locus- The pspC locus and the pspA gene of 
EF6796 were mapped using restriction enzymes. After digestion of 
chromosomal DNA with Hindlll, the pspC locus was localized to a 
fragment of approximately 6.8 kb. Following a double digest with 
Hindlll and EcoRI, the pspC locus was located in a 3.5 kb 
fragment. To obtain the intact pspC gene of EF6796, chromosomal 
DNA was digested with tfindlll, separated by agarose gel 
electrophoresis , the region between 6 and 7.5 kb purified, and 
subsequently digested with EcoRI. This digested DNA was analyzed 
by electrophoresis, and DNA fragments of 3.0 to 4.0 kb were 
purified (GeneClean, BiolOl, Inc., Vista, CA) . The size- 
fractionated DNA was then ligated in tfindlll-EcoRI-digested 
pZero, and electroplated into E. coli TOP10F' cells. Kanamycin- 
resistant transf ormants were screened by colony blots and probed 
with full-length pspA. A transf ormant, LXS200, contained a 
vector with a 3.5 kb insert which hybridized to pspA. 

Sequencing of pspC in pLXS200 was completed using 
automated DNA sequencing on an ABI 377 (Applied Biosystems r Inc., 
PLACE) . Sequence analyses were performed using the University Of 
Wisconsin Genetics Computer Group (GCG) programs supported by the 
Center for AIDS Research (P30 AI27767) , MacVector 5.0, Sequencer 
2.1, and DNA Strider programs. Sequence similarities of pspC 
were determined using the NCBI BLAST server. The coiled-coil 
structure predicted by pspC sequence was analyzed using Matcher. 
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A gene probe for cloning the pspC locus. Two oligonucleotide 
primers, N192 and C558 (shown in Figure 19) , have been used 
previously to clone fragments homologous to the region of Rxl 
pspA encoding amino acids 192-588 from various pneumococcal 
strains. These primers are modifications (altered restriction 
sites) of LSM4 and LSM2 which were previously shown to amplify 
DNA encoding the C-terminal 396 amino acids of PspA. Rxl (Figure 
17) ; this includes approximately 100 amino acids of the a-helical 
region, the proline rich region, and the C-terminal choline- 
binding repeat region. Using primers N192 and C558, a 1.2 kb 
fragment from strain EF6796 was amplified by PCR, and 
subsequently cloned in pET-9A (designated PRCT135) . This insert 
was then partially sequenced. 

Independently, a larger pspA fragment from strain 
EF6796 was made using primers LSM13 and SKH2 (shown in Figure 19) 
for the purpose of direct sequencing of serologically diverse 
pspA genes. 

The LSM13 and SKH2 primer pair result in the 
amplification of the 5' end of most pspA gene(s) encoding the 
upstream promoter, the leader peptide, the a-helical, and the 
proline-rich regions (amino acid -15 to 450) (Figure 20) . From 
the strain EF6796, the LSM13 and SKH2 primers amplified a 1.3 kb 
fragment (pspA*EF6796) , which was sequenced. The sequence from 
PRCT135 and the LSM13/SKH2 PCR-generated fragment pspA.EF6796 was 
not identical. The fragment obtained by PCR using primers LSM13 
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and SKH2 was designated pspA based on its location within the 
same chromosomal location as pspA.Rxl. The cloned fragment in 
pRCT135 was assumed to represent the sequence of the second gene 
locus, pspC, known to be present from Southern analysis. Both 
genes have significant similarity to the corresponding regions of 
the prototype pspA gene from strain Rxl. The second gene locus 
was called pspC, in recognition of its distinct chromosomal 
location, not sequence differences from the prototype pspA gene. 

Analysis of the nucleotide and amino acid sequence of pspC 
EF6796. To test the hypothesis that pRCT135 represented pspC of 
EF6796, and to further investigate pspC, the entire EF6796 pspC 
gene was cloned as a 3.4 kb Hindlll-EcoRl fragment forming 
PLXS200. DNA sequence of the pspC-containing clone pLXS200 
revealed an open reading fram of 2782 nucleotides based on the 
analysis of putative transcriptional and translation start and 
stop sites (Figure 21). The predicted open reading frame encodes 
a 105 kDa protein which has an estimated pi of 6.09. 

PspA. Rxl and PspC.EF6796 are similar in that they both 
contain an a-helical region followed by a proline-rich domain and 
repeat region (Figure 20). However, there are several features 
of the amino acid sequence of PspC which are quite distinct from 
PspA. From comparisons at the nucleotide as well as the 
predicted amino acid sequence, it is apparent that the region of 
strong homology between PspC and PspA begins at amino acid 458 of 
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PspC (amino acid 147 of PspA) and extends to the C-terminus of 
both proteins (positions 899 and 588 respectively) . The 
predicted amino acid sequence of PspC.EF6796 and PspA.Rxl are 76% 
similar and 68% identical based on GCG Bestf it program for this 
region (Figure 22) . The nucleotide sequence identity between 
pspC and pspA is 87% for the same region. Eight bases upstream 
of the ATG start site is putative ribosomal binding site, 
TAGAAGGA* The proposed transcriptional start -35 (TATACA) and - 
10 (TATAGT) regions are located between 258 to 263 and 280 to 
285, respectively (Figure 21) . A potential transcriptional 
terminator occurs at a stem loop between nucleotides 3237 through 
3287, The putative signal sequence of PspC is typical of other 
gram positive bacteria. This region consists of a charged region 
followed by a hydrophobic core of amino acids. A potential 
cleavage site of the signal peptide occurs at amino acid 37 
following the Val-His-Ala. The first amino acid of the mature 
protein is a Glu residue. 

Other than features similar to all signal sequences, 
there is no homology in this region between pspA and pspC. This 
confirms that pspC is present in a separate chromosomal locus 
from that of pspA. The signal sequence and upstream region have 
striking similarity to the similar regions of s. agalactiae 6 
antigen (accession number X59771) . The fcantigen of Group B 
streptococci is a cell surface receptor that binds IgA. 
Similarity to the jbac gene ends with the start of the mature 
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protein of PspC, and the nucleotides are 75% identical in this 
region. Thus, although pspC is in a very similar chromosomal 
locus to the 6 antigen, it is clearly a distinct protein. 

The N-terminus of PspC is quite different from the N- 
termimis of PspA. Prediction of the secondary structure 
utilizing Chou-Fausman analysis (Chao, P.Y. et al., Adv. Enzymol. 
Relat Areas Mol. Biol. 1978: 17: 45-148), suggests that the 
structure of amino acids 16 to 589 of PspC is predominately cr- 
helical. The Matcher program was used to examine periodicity in 
the a-helical region of PspA* The characteristic seven residue 
periodicity is maintained by having hydrophobic residues at the 
first and fourth positions (a and d) and hydrophobic residues at 
the remaining positions. The coiled-coil region of the a-helix 
of PspC (between amino acid 32 to 600) has three breaks in the 
heptad repeat (Figure 23) . These disturbances in the 7 residue 
periodicity occur at amino acids 99 to 104, 224 to 267 and 346 to 
350. The a-helical region of PspA has seven breaks in the motif, 
each break ranging from a few amino acids to 23 amino acids each* 
In contrast, the three breaks in the coiled-coil motif of PspC 
involve 5, 43 and 4 amino acids, respectively - 

The sequence encoding the a-helical region of PspC 
contains two direct repeats 483 nucleotides (160 amino acids) 
long which are 88% percent identical at the nucleotide level. 
These repeats, which occur between nucleotides 562 to 1045 and 
nucleotides 1312 to 1795, are conserved both at the nucleotide 
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and amino acid level (amino acids 188 to 348 and 438 to 598) 
(Figure 24) . PspA lacks evidence for any repeats this prominent 
within the cr-helical region. These repeat regions could provide 
a mechanism for recombination that could alter the N-terminal 
half of the PspC molecule . Although repeat motifs are common in 
bacterial surface proteins, a direct repeat this large or 
separated by a large spacer region is novel. The evolutionary 
significance of this region is not known. A Blast search of the 
repeat region and the 267 nucleotide bases between them revealed 
no sequence with significant homology at the nucleotide or amino 
acid level. However, one of the structural breaks in the coil- 
coiled region of PspC is the region between the two repeats. 
Perhaps some deviation from coiled-coil structure between the two 
repeats is critical to maintain the ce-helical structure. 

Previous studies have shown that a major cross- 
protective region of PspA comprises the C-terminal 1/3 of the a- 
helical region (between residues 192 and 260 of PspA.Rxl). This 
region accounts for the binding of 4 of 5 cross-protective 
immunity in mice. Homology between PspC and PspA begins at amino 
acid 148 of PspA, thus including the region from 192 - 299. The 
homology between PspA and the PspC includes the entire PspC 
sequence C-terminal of amino acid 486. Based on the fact that 
PspA and PspC are so similar in this region known to be 
protection-eliciting, PspC is also likely to be a protection- 
eliciting molecule. Because of close sequence and conformational 
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similarity of the proteins in this region, antibodies specific 
for the region of PspA between amino acid 148 and 299 should 
cross-react with PspC and thus afford protection by reacting with 
PspC and PspA* Likewise, immunization with the PspC would be 
expected to elicit antibodies cross-protective against PspA. The 
differences between PspC of strain EF6796 and PspA of strain Rxl 
is no greater than the differences between many additional PspAs, 
which have been shown to be highly cross-protective. 

A proline-rich domain exists between amino acid 590 to 
652. The sequence, PAPAPEK, is repeated six times in this 
region. This region is very similar to the proline-rich region 
of PspA. Rxl which contains the sequence PAPAP repeated eight 
times in two proline-rich regions. These two regions of PspA. Rxl 
are separated by 27 charge amino acids; no such spacer region is 
present in PspC. 

Many cell surface proteins of other gram positive 
bacteria contain proline-rich regions. These are often 
associated with a domain of protein that is predicted to be near 
the cell wall murein layer when the protein is cell-associated. 
For example, in M proteins of S. pyogenes this domain contains 
both a Pro- and Gly-rich regions. The f ibronectin-binding 

* 

protein of S. pyogenes, S. dysgalactiae , and Staphylococcus 
aureus contains a proline-rich region with a three-residue 
periodicity (pro-charged-uncharged) that is not found in PspA or 
PspC. An M-like protein of S. equi contains a proline-rich 
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region that is comprised of the tetrapeptide PEPK- This region 
lacks glycine normally found in the proline regions of M- 
proteins. The last proline repeat region of this molecule is 
PAPAK, which is more similar to the proline-region of PspA and 
PspC than it is to M- proteins. 

Proline-rich regions of gram positive bacterial 
proteins have been reported previously to transit the cell wall. 
The differences in proline-rich regions of proteins from diverse 
bacteria may reflect differences in protein function or possibly 
subtle differences in cell wall function. Proline-rich regions 
are thought to be responsible for aberrant migration of these 
proteins through SDS-polyacrylamide gels. 

The repeat region of PspC is a common motif found among 
several proteins in gram positive organisms. Autolysin of S, 
pneumoniae, toxins A and B of Clostridium difficile , 
glucosy 1 trans f erases from S. downei and S. mutans , and CspA of C. 
acetobiltylicum all contain similar regions. In PspA these 
repeats are responsible for binding to the phosphatidylcholine of 

* 

teichoic acid and lipoteichoic acid in cell wall of pneumococci. 
However, bacterial proteins containing C-terminal repeats are 
secreted, which may imply either a lost or gained function. 
Although all of these proteins have similar repeat regions the 
similarity of the repeat regions of PspA and PspC is much greater 
than that of PspC to the other proteins (Table 66) . 
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Interestingly, PspC like PspA has a 17 amino acid 
partially hydrophobic tail. The function of this 17 amino acid 
region is unknown. In the case of PspA it has been shown that 
mutants lacking the tail bind the surface of pneumococci as well 
as PspAs in which the tail is expressed. Presently, it is now 
known whether PspC is attached to the cell surface or secreted. 

PspA and PspC proteins both have a-helical coiled-coil 
regions, proline-rich central regions, repeat regions, with a 
choline binding motifs, and the C-terminal 17 amino acid tail. 
PspA and PspC share three regions of high sequence identity. One 
of these is a protection-eliciting region present within the 
a-helical domain. The other two regions are the proline-rich 
domain and a repeat domain shared with other choline binding 
proteins and thought to play a role in cell surface association. 
The similarity throughout most of the structure of the PspA and 
the PspC molecules raises the possibility that the two molecules 
may play at least slightly redundant functions. However, the 
fact that the N-terminal half of the protein is not homologous to 
any of the a-helical sequence of PspA suggests the PspC and PspA 
may have evolved for at least somewhat different roles on the 
cell surface. One of the most striking differences between the 
two molecules is the single repeat in the a-helical region of 
PspC Although neither the exact function of PspA nor of PspC 
are known, the observation that a major cross-protective region 
of PspA is highly homologous with a similar region of PspC, 
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raises the possibility that both molecules are protection- 
eliciting and elicit cross-protective antibodies. 

The sequence similarity between the promoter region of 
the pspC gene and the bac gene from group B streptococci is very 
intriguing. It implies that an interspecies recombination event 
has occurred and, this interspecies recombination has contributed 
to the evolution of the pspC. The pspC gene thus has a chimeric 
structure, being partially like pspA and partially like the 6 
antigen. In the latter case, all protein similarity is limited 
to the signal sequence. Similar interspecies recombination 
events have contributed to the evolution of the genes encoding 
penicillin binding protein. 

Using analogous procedures, a second PspC sequence was 
isolated from strain D39 of S* pneumoniae. Figures 25 to 29 show 
the sequence data of PspC from strain D39, complete from upstream 
of the promoter through the proline-rich region. Strain D39 has 
the same genetic background as strains Rxl, from which pspA was 
sequenced. D39 and Rxl have the same pspC gene based on Southern 
blot analysis. 

The alpha-helical encoding region of the D39 pspC gene 
is one third of the size of the homologous region from the EF6796 
pspC gene. The proline-rich region of the D39 pspC gene was more 
similar to Rxl pspA than to EF6796 pspC* Even so, the two pspC 
genes were 86% identical at the nucleotide sequence, and 67% 
identical at the amino acid level. 
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In the alpha-helical sequence of EF6797 pspC a strong 
repeat was observed. This was absent in the pspC sequence of 
D39, The D39 pspC sequence also lacks a leader sequence, found 
in the EF6797 pspC sequence. 

This data strongly indicates that there is variability 
in the structure of pspC, similar to previous observations for 
pspA. In the case of pspC, however, the extent of variability 
appears to be even greater than that which has been observed for 
pspA* 
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Table 66. 



PERCENT HOMOLOGY OF 


CHOLINE BINDING 


REGIONS 1 


Percent 
similarity/identity 


Protein 


Organism 


PspA 


PspC 


PSpC 


5. pneumoniae 


86/60 


100/100 


Bacteriophage Cp- 
1 


S. pneumoniae 


56/30 


56/28 


LytA 


S. pneumoniae 


57/33 


61/32 


P&pA 


C. perfrlngens 


64/45 


59/42 


alpha toxin 


C novyl 


54/29 


57/33 


CspB 


C. 

ace tobuty Ileum 


58/36 


61/45 



Having thus described in detail certain preferred 
embodiments of the present invention, it is to be understood that 
the invention defined by the appended claims is not to be limited 
by particular details set forth in the above description, as many 
apparent variations thereof are possible without departing from 
the spirit or scope thereof. 
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WflAT IS CLAIMED 18 ; 

1. An isolated amino acid molecule consisting of 
residues 1 to 115, 1 to 260, 192 to 588, 192 to 299, or residues 
192 to 260 of pneumococcal surface protein A of Streptococcus 
pneumoniae* 

2. An isolated DNA molecule consisting of a fragment 
of pneumococcal surface protein A gene of Streptococcus pneumonia 
encoding the isolated amino acid molecule of claim 1* 

3* A PCR primer consisting essentially of the isolated 
DNA molecule of claim 2. 

4. A hybridization probe consisting essentially of the 
isolated DNA molecule of claim 2. 

5. An immunological composition comprising the amino 
acid molecule of claim 1. 

6. An isolated DNA molecule consisting of nucleotides 
1 to 26, 1967 to 1990, 161 to 187, 1093 to 1117 or 1312 to 1331, 
or 1333 to 1355 of a pneumococcal surface protein A gene of 
Streptococcus pneumoniae ♦ 

7* A PCR primer consisting essentially of the isolated 
DNA molecule of claim 6. 

8. A hybridization probe consisting essentially of the 
isolated DNA molecule of claim 6. 

9. An isolated DNA molecule consisting of a fragment 
of a pneumococcal surface protein A gene of Steptococcus 
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pneumoniae consisting of a nucleotide sequence (5' to 3') 
selected from 

CCGGATCCAGCTCCTGCACCAAAAAC ," 
GCGCGTCGACGGCTTAAACCCATTCACCATTGG ; 
CCGGATCCTGAGCCAGAGCAGTTGGCTG ; 
CCGGATCCGCTCAAAGAG ATTGATGAGTCTG ; 
GCGGATCCCGTAGCCAGTCAGTCTAAAGCTG ; 
CTGAGTCGACTGGAGTTTCTGGAGCTGGAGC ; 
CCGGATCCAGCTCCAGCTCCAGAAACTCCAG ; 
GCGGATCCTTGACCAATATTTACGGAGGAGGC ; 
GTTTTTGGTGCAGGAGCTGG ; 
GCTATGGGCTACAGGTTG ; 
CCACCTGTAGCCATAGC ; 

CCGCATCCAGCGTGCCTATCTTAGGGGCTGGTT ; and 
GCAAGCTTATGATATAGAAATTTGTAAC . 

10. A PCR primer consisting essentially of at least 
one isolated DNA molecule of claim 9. 

11. A hybridization probe consisting essentially of at 
least one isolated DNA molecule of claim 9. 

12. PCR probe (s) which distinguishes between pspA and 
pspA-like nucleotide sequences. 

13. PCR probe (s) which hybridizes to both pspA and 
pspA-like nucleotide sequences. 

14. A PspA extract prepared by a process comprising 
growing pneumococci in a first medium containing 

choline chloride, 

eluting live pneumococci with a choline chloride 
containing salt solution, and 

growing the pneumococci in a second medium containing 
an alkanolamine and substantially no choline. 

15. A PspA extract prepared by 
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growing pneuroococci in a first medium containing 
choline chloride, 

eluting live pneumococci with a choline chloride 
containing salt solution, 

growing the pneumococci in a second medium containing 
an alkanolamine and substantially no choline, and purifying PspA 
by isolation on a choline-Sepharose affinity column. 

16. An immunological composition comprising the 
extract of claim 14. 

17. An immunological composition comprising the 
extract of claim 15. 

18. An immunological composition comprising full 
length PspA. 

19. A method for enhancing immunogenicity of a PspA- 
containing immunological composition comprising including in said 
composition the C- terminal portion of PspA. 

20. An immunological composition comprising at least 

two PspAs. 

21. The immunological composition of claim 20 wherein 
the PspAs are from different groups based on RFLP. 

22. PCR amplification product from a primer as claimed 
in claims 3, 7, 10, 12 or 13. 

23. An isolated DNA molecule consisting of a 
nucleotide sequence homologous to a portion of pspA. 
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24. An isolated amino acid molecule comprising 
pneumococcal surface protein C, PspC, of Streptococcus pneumoniae 
having alpha-helical, proline rich and repeat regions. 

25* An isolated DNA molecule comprising a pneumococcal 
surface protein C gene of S* pneumoniae encoding the isolated 
amino acid molecule of claim 24. 

26. A PCR primer consisting essentially of the 
isolated DNA molecule of claim 25. 

27. A hybridization probe consisting essentially of 
the isolated DNA molecule of claim 25. 

28. An immunological composition comprising the amino 
acid molecule of claim 24. 

29. An isolated amino acid molecule of claim 24 having 
strong homology with pneumococcal surface protein A, PspA, of S, 
pneumoniae from amino acid 458 of PspC, corresponding to amino 
acid 147 of PspA, extending to a c-terminus of PspC and PspA. 

30. An isolated amino acid molecule of claim 24, 
further comprising a signal sequence consisting essentially of a 
charged region followed by a hydrophobic core of amino acids. 

31. An isolated amino acid molecule of claim 24, 
wherein the alpha-helical region further comprises a seven 
residue periodicity and a coiled coil region having three breaks 
in a heptad repeat. 

32. An isolated amino acid molecule comprising 
pneumococcal surface protein C, PspC, of S. pneumoniae having 
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alpha-helical r proline rich and repeat regions, wherein the 
alpha-helical region comprises a C-terminus having substantial 
homology with a protection-eliciting region of PspA. 

33. An isolated DNA molecule comprising a pneumococcal 
surface protein C gene of S. pneumoniae encoding the isolated 
amino acid molecule of claim 32. 

34 . A PCR primer consisting essentially of the 
isolated DNA molecule of claim 33. 

35. A hybridization probe consisting essentially of 
the isolated DNA molecule of claim 33. 

36. An immunological composition comprising the amino 
acid molecule of claim 32. 

37. An isolated amino acid molecule of claim 24, 
further comprising a 17 amino acid, partially hydrophobic tail. 

38. An isolated amino acid molecule of claim 32, 
further comprising a 17 amino acid, partially hydrophobic tail. 

39. An isolated amino acid molecule of claim 24, 
further comprising an epitope of interest. 

40. An isolated amino acid molecule of claim 32, 
further comprising an epitope of interest* 

41. An immunological composition comprising the amino 
acid molecule of claim 39. 

42. An immunological composition comprising the amino 
acid molecule of claim 40. 
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AMINO ACID SEQUENCES IN THE NH2 -TERMINAL END OF DIFFERENT PSPA GENES . Gap 

inserted to maximize alignment with related PspA sequences). - 

ACCCG303 MNKKK2HLT5 IASVAXLGTG rVASPFTLVR AEESPQWEK SSLEKKYEEA 

KAKADTAKKD YETAKK. . .K AEDAQKKYDE DQKKTEDKAK A.VKKVDEER 
QKAILAVQKA YVEY, . . .RE AXDKASAEKQ IAEAKRKT 



Ac94. . . MNKKKMILTS LASVAILGAG LVTAQPTLVR AEEAP.VASQ SKAEKDYDTA 

KRDAENAKKA LEEAKR AQXKYED DQKKTEEKAK E.EKQASEAE 

QKANLQYQLK LREYIQ, -KT GDRSKIQTEM EEAEKKHKTA KAEFDKVRGT 
VIPSAARV 

Bgll703pro MJKKKMILTS LASVAILGAG LVTSQPTLVR AEEAP.VASQ SKAEKDYDAA 

VKKSEAAKKA YEEAKK. . .K AEDAQKKYDE DQKKTEEKA. ENEKKAAADL 
TEATEVHQKA YVRYSGSNEQ KJKNFKILAI 



MXKKKHILTS IASVAILGAG tXVASQPTXVR AEDAP.VANQ SQAEKDYXAA 
XXKSEAAKKX YXXAKKVLAE AEAAQKXXED XQKKPEEKA. EKAKAASEEI 
VKATSEVQXA A • ♦ . / 



MNKKKMILTS LASVAILGAG LVTSQPTLVR AEEAP.GASQ SKAEKDYXAA 
XKKSEAAKKA YEEAKK. * .K AEDAQKKYDE GQKKTEEKA. RKAEEASKEL 
AKATSEVQNA YVKYQGVQRN SRLNEKERKK QLAEIDEEIN KAKQIWNEKN 
EDFKKVREEV IPEPTELAKD QRKAEEAKAE EKVAKRKYDY ATLKVALAK5 
YVEAEEAXL* 

MNKKKMILTS LASVAILGAG LVTSQPTFVR AEEAP.VASQ PKAEKDYDPA 
GKKSEAATKA YEDAKP. . .T AEDAQKKYDE AQKKPDAER 



Bg7322pro 



Bg7561pro 



Bg8090pro 



Bg8743pro MNKKKMILTS LASVAILGAG LVASQPTWR AEEAP.VAKQ SQAERDYDAA 

MKKSEAAKKE YEEAKKDLEE AKAAQKKYGG DPKKTGEETK LVPK.ADGER 
PKANVAVPKA YLKLREAQEQ LNQSPNNKKN SAQQKLKDAL AHIDEVTLNQ 
KEAEA. 



Bg8838pro MNKKKMILTS LASVAILGAG LVTSQPTWR AEESP.VASQ SKAEKDYDAA 

VKNATAAKKA AEDAHRALDE AKAAQKNYDE DQKKPEEKAK EVPKAFAEE. 
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Bg9163pro m^kkk^ts lasvailgag lvasqptlvr aedap/vanq sqaekdydaa 

MKKSEAAKKE YEDAKKVLAE AEAAQKKYED DQKKTEEKA. ENANAASEEI 
AKATEEVH : » 



Bg9739pro MNKKKMILTS LASVAILGAG LVASSPTWR AEEAP.VASQ SKAEKDYDTA 

KRDAEftZAKKA I.KRAKR AQEKYAD YQRRIEEKAA K.ETQASLEQ 

QEANKDYQLK LKKYLDGRNL SNSSVLKKEM EEAEKKDKEN QAEFNKIRRE 
IWPNPQELE MAR RKSEWX ATESOLVTRV EEAEKNVTDA RQKLVLHCNE 
WLQAXXAEL ESGGHKLEPK 

Dbllpro MNKKKMILTS LASXAXLGAG LVASQFTWR AEEAP.VASQ SKAEKDYDAA 

KRDAENAKKA LEEAKR AQKXXED DQKKTBEKAK X.DXQASEAE 

QKANLXYQLL LQKYVSESDG KKKKEXEXXA DAAKKEIELK XADLXKIXQE 



DblSpro MNKKKMILTS LASVAILGAG LVASQPTWR AEEAP.VASQ SKAEKDYDAA 

VEKSKAAEED LB E AEAAQRKYDE DQKKSEENEK E.TEEASERQ 

QAATLKYHLE SXEFLNYFQD NHR 



DblSaapro 



Dfol6apro 



E£10197pro 



Ef3296pro 



Ef6796pro 



L819D5pro 



Rxlpro 



MNKKKMILTS 
KRDAENAKKA 
QEANKDYQLK 
IWPNPQELE 
EWLQAQIA. 

MNKKKMILTS 
KRDAENAKKA 
QEANKDYQLK 



LASVAILGAG LVASPPTWR AEEAP.VASQ SKAEKDYDTA 

LEBAKR AQEKYAD YQ RRIEE KAA K.ETHASLEQ 

LKKYLDGRNL SNSSVLKKEM EEAEKKDKEK PAEENKIRRE 
MARRKSEVAK TKESGLVKRV EEAEKKVTEA RPKLDAERAK 



LASVAILGAG LVASPPTWR AEEAP . VASQ SKAEKDYDTA 

LEEAKR AQEKYAD YQRRIEEKAA K.ETHASLEQ 

LKKYLDGRNL SNSSVLKKEM EEAEKKDKEK QAGL 



1 



MNKKKMILTS 

KRDAENAKKA 
QEANKDYQLK 
IWPNPQELE 
EWLQPTR+V 
MNKKKMILTS 
KAKADTAKKD 
XDVALWQNA 
EQQDLQNNFN 



LASVAILGAG 



LKKYLDGRNL 
MARRKSEWK 
ENEVHKLXQK 
LASVAILGAG 
YETAKK. . »K 
YK35Y* * » »KE 
EVRAWAPDP 



LVTSQPTLVR AEESP.VASQ SKAEKDYDAA 
• . * AQEKYAD YQR RIEE KAA K.EQQASLEQ 
SNSSVLKKEM EEAEKKDKEK QAEFNKIRRE 
AKE6GLVKRV EEAEKKVTEA RQKLDAERAK 

LVTSQPTFVR AEESPQWEK SSLEKKYEEA 
AEDAQKKYED DQKRTEEKAR K.EAEASQKL 
VQNQRSKYKS DADYQKKLTE VDSKIEKARK 
TCVGXDXR 



MNKKKMILTS LASVAILGAG XVTSQPTXVR AEEAPQWEK SSLEKKYEEA 
KAKYDAAXKD YDEAKK. . .K AAEAQKKYEE DQKKTEEKAE K.AKAASEEI 
AKATEEVQKA VLDYITAIRN HNDSGKTSAE EAENKAKERD YCCAGKKFDP 
IQTPFVASLT QMXL # 

MNKKKMH/TS LASVAILGAG LVASSPTWR AEEAP.VASQ SKAEKDYDTA 

KRDAENAKKA LEEAKR .AQEKYAD YQRRIEEKAA K.ETQASLEQ 

QEANKDYQLK LKKYLDGRNL SNSSVLKKEM EEAEKKDKEN QAEFNKIRRE 
IWPNPQELE MA . * # . % 

MNKKKMILTS LASVAILGAG FVASQPTWR AEESP.VASQ SKAEKDYDAA 
KKDAKNAKKA VEDAQKALDD AKAAQKKYDE DQKKTEEKA. ALEKAASEEM 
DKAVAAVQQA YLAYQQATDK AAKDAADKMI DEAKKREEEA KTKFNTVRAM 
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Wu2pro 



Ef5668pro 



Bg6692pro 



WFEFEQLJU5 

DA 

MNKKKMILTS 
VKKSEAAKKA 
AKATEEVQQA 
WPEPEQLAE 

da 

MNKKKMILTS 
VKKSEAAKKD 

AEATKEVQQA 
QLYFLNQVSY 



TKKKSEEAKQ KAPELTKKLE EAKAKLEEAE KKATEAKQKV 



LASVAILGAG 
YEEAKKALEE 
YLAYQRASNK 
TKKKAEEAKA 



LASVAILGAG 
YETAKK, . .K 
YLAYLGASNE 
LRLRKXQKRQ 



LVASQPTLVR AEESP.VASQ SKAEKDYDAA 
AKVAQKKYED DQXKTEEKA. ELEKEASEAI 
A. ♦EAMQUE EAQRMNEAR AKFTTIRTTM 
KERKLAKKAA EAKAKLEEAE KKATEANPQV 

FVASSPTFVR AEEAP-VANQ SKAEKDYDAA 
AEDAQKKYDE DQKKTEAKAE K.ERKASEKI 
SQKKEADKKI KEATHAKMRR TCNLTIEFEQ 
QKKQKYLRKN LKRQLKRYKY RKIKYLNKML 



MNKKKLIVTS LASVAUGAD SVT5PPALVR ADEASLIASQ SKAEKDYDAA 
KKDAXNAKKA VEDAQKALDD AKAAQKKYDE DQKKTEKKAA AV.KKIDEEH 
QAANLKSQQA LVEFLAAQRE GNPKKKKAAQ ATLEEAENAE KETK 



Acl22pro MNKKKMIKTS LASAAIFGAX SETSQPTRVR PVEAPE.&RH PKVDKYYDAE 

ADEY - 

•••••»»■•• ••*•••••*« ■««*•••••« 



A66pro MNKKKMILTS LASVAILGAG FGCVSAYSCK SKRISRS*SA *SSQRL 



L82013pro MNKKKMILKS LASAAISGAX LVXPQPTLVR AEESP.AASQ SHPEQDYDXX 

XXLCXXLXHQ PSXGKPLLXX XXSXPXSPTP XXXXXXPXSX LTXLXPLXXX 
LKPFPLPXSX PXPPXPPXSP PSPPPRPXLY XXPPXPXPXL SLXLIPFLLL 
XLPPPXXXLP HLXSPPXPXL 1 PPSPTPX 

\ ; 
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SEQUENCES IN THE CENTRAL REGION - { Includes CArboxy- terminus of alpha -heli 

and region and some of the proline-rich region. Gaps are inserted to 
maximize alignment related PSpA sequences.) 
30 336 

0922134c 

.L KEIDESDSED YLKEGLRAPL QSKLDTKKAK LSKLEELSDK 

IDELDAEIAK LEVQLKDAEG KNNVE A.YFKEGLEK TTAEKKAELE 

KAEADLKKAV DEPEXPAPA PQPA PAPEKPAE K 

PAPAPAP... EKPAPAPE .K.PAEK PAEKPAEEPA EKPAPAPEK. 

.PAPTPE .KPAPTPETP KTGWKQENGM 



Atcc6303c 



Ac 12 2c 



A66c 



V LDXTIAEGKA GIAAXPPNID 

XT PKDti EDSGLGLEKV LATLDPGGET PDGLDKEASE DSNIGALPNQ 

VSDLENQVSE LDREVTRLPS DLKDTEGNNV GUYVKGGLEK ALTDEKVGLN 

NTPKALDTAP KALDTALNEL G.PDGDEEET PAPAPKPE . . . .QPA 

EQP K PAPAPK PEKTODQQAE EDYARRSEEE 

YNRLPQQQPP KAEK. .PAPA PKPEQPVPAP 



GGW SWR*ILLARP 

DRXiAARQAEL AQKQTELGKL LDSLDPEGKT QDKLDKEAGE . . . .AELDKK 

ADGLPNKVSD LEKEISNLEI LLGGADSEDD T AALPN KLATKKAELE 

KTQKELDAAL NELG PD6DSEET PAPAPQPE ♦ . * .Q 

PAPAPKPEQ PTPAPK PEQPTPAPKP EQ. .PAP. . . 

AP KPEQ. .PAPA PKFEQPAPAP KP,EQPTPGP KIB. 



**••*«•« 



Ac94c 



Bg8090c 



Bg8743c 



E LLLLEKAGLG 

KAGADLKEAV NEPGESAGEP SQPEEPAEEA PAPEQPTEPT 

QPEEP AGETPAPKPE K. . .PAGQPK AEKTDDQQAE EDYARR 

VNRLTQQQPP KAEXPAPA. ! PQPEQPAPAP K - 



ci si oi 



«L KEIDESDSED YVKEGLRVPL QSELDVKQAK LLKLEELSDK 

IDELDAEIAK NLKKDVEDFQ NSGGGYS, - . .ALYLEAAEK DLVAKKAELE 

KTEADLKKAV NEPEKPAEE PENPAP APK 

PAPAPQP... . ..EKPA PAPAPK PEKSADQQAE EDYARRSEEB 

YNRLTQQQPP KAEKPAPAPV PKPEQPAPAP KSR 

VXLDRGPAEA AVKEQVDSPP QQIiAD*VKEI STHGKFLGGA ATEDETSALP 

NKITAKQAEL AXXQTELEKL LDNLDPEGKT QDELDKKAAE AELDKK 

ADELPNKVAD LEKEISNLEI LLGGADPEDD T AALPN KLATKKAEFE 

KTPKELDAAL KGLG . .PDGDEEET PA 

PAPAPKPEQ PAPA PAPKP EQPAPAP. . . 

►AP KPEQPAPAPA PKPEQPTOAP K 



L KEIDESDSED YIKEGLRAPL QSKLDAKKAX LSKLDELSDK 

IDELDAEIAK LEKDVGDFPN SDGEQ AGQYLVAAEK DLDAKEAELG 

RTCADLKKAV DEPETPAPA PAPK PAPAPAPT P 

EAPAPA* . . ♦ FKPAFAPK PAPAPK PAFAFKPAPA PKPAPAFK. . 

. PAPAPKPE RT 



Bg9163c 



... 



END 
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GVQRTRKRAP KRIMSLSQKV XLKXVCRAPL QSKLDAQKAE LLKLEELSGK 

IEELDAEIAE LEVQLKDAEG NNNVE A.WKEGLEK TTAEKKAELE 

XAXADLKKAV DEPETPAPA. . . , . . .PAPA PAP A 

PAPAPA. . . - PAPAPAPK PAPAPK PAPAPAPAPA PKPAPAPK. . 

PAPAPAPA PKPEKPAEKP APAPKPETXK TOG 



Bg9739c 




L KEIDESDSED YVKEGFRAPL QSELDAKQAK Li 

IDELDAEIAK LEKDVEDPQN SDGEQ AGQYLAAAGE DL! 

KAEADLKKAV DEPETPAPA PA* . PAPAPAPT P 

EAPAPAPAPA PKPAPAPK PAPAPK PAPAPKPAPA PKPAPAPK. . 

PAPAPAPA PKPEKPAEKP APAPKFE. . . . . 



EflOISc 



Ef 3296c 



E£6796xc 



L KEIDESDSED YVKEGFRAPL QSELDAKQAK IiSKLEELSDK 

IDELDAEIAK LEDQLKAAEE NNNVE DYFKEGLEK TIAAKKAELE 

KTEADLKKAV NEPEKPAEEP SQPEKPAEBA PAPEQPTEPT QPEKPAEQPQ 
PAPAPQPEKP AEETPAPKPE K. /.PAEQPK AEXPADQQAE EDYARRSEEE 
YNRLTQQQPP KAEKPAPA. , PKTK. . . - , 



GGS 

NELAAXQAEL 
ADELPNKVAD 
KTQKELDAAP 
PAPAPKPEQ . 
AP 



ALDQEAAAPP HQVADLEKQI 
AQKPTGLEKL LDSLDPGGKT 
LEKEISNLEI LLGGADSEDD 

NELG PDGDEEBT 

PAPAPK 

KPEQ..PAPA PKPEQPAKPE 



TGPEIFLGGA DPEADIAARP 

QDELDKEASE . AELDKK 

T. . . .AALEN KLAXKXAELE 

PAPAPQPE .Q 

PEQPAPAPKP EQ. .PAP. 
KPAEEPTQPE KPATPKT. 



fiblSc 



L81905C 



RCCl 15C 



VRAL. .KVAE FGVQLRDAG6 SNNVG A.YFKEGLEE TTAEXEAGLG 

KAEADLKKAV DEPET PAP A 

PAPAPA PAPAPAPK PAPAPK PAPAPAPAPA PKPAPAPK. . 

PAPAPAPA PKPEKPAEKP APAPKPETPK T. 



L KDIDESDSED YAKEGLRAPL QSELDTKKAK LLKLEELSGK 

IEELDAEIXE LEVQLKDAEG NNNVE A.YFKEGLEK TTAEKKAELE 

KAEADLKKAV DEPETPAPA PAPA PAPAPTPE A 

PAPAPA PKPAPAPK PAPAPK PAPAPKPAPA PKPAPAPKPA 

PAPAPAPAPK PAPAPAPAPA PKPEKPAEKP APAPKPETPK TGWKQENGM. 



♦ . -L KEIDESDSED YVKEGFRAPL QSELDAKQAK LSKLEEXSDK 

XDETiDAETAK LEKDVEDFKN SDGEQ AGQYLAAAEE DLIAKKAXLE 

KAEADLKKAV DEPETPAPA PA. • PAPAPAPT P 

EAPAPA PAPAPK PAPAPK PAPAPKPAPA PKPAPAPK.. 

. . . PAPAPAPA PKPEKPAA . 



# . • * 



* • 



# * 



IiKEIDESDVE VKKMXELVK EEAKEPRNES KVKQAKAEVE 

SKKAEATRLE KIKTDRKKAE EAKRKAAEED KVKEK 

♦ .PAPKPEN PAEQPK AEKPADQQAE EDYARRSEEE 

YXRIiTQQQPP KTEKPAQPST PKT , , 
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Rctl21c 



RCtl23c 



Kctl29c 



Rctl3 5c 



. . . . « K GEARESRXEE KVNQPKXEVE 

SKKXEATRIiE KIKTDRKKAE EAXEKAAEED KVKEKFAEQP QFAPAFQFEK 

PAPAPKPEN PAEQPK AEKPADQQAE EDYAHRSEBE 

TORI/PQQQPP KTEKPAQPST XK 1 



I KEX DE SXSED YLKEGLRAPL QS KLDT KKAK LSKLEELSDK 

IDELDftEIAK LEVQLKDAEG NNNVB A.YFKEOLEK TTAEKKAELE 

KAEADLKKAV DEPETPAPA PQPA PAPEKPAE K 

PAPAPAP. EKPAPAPEK. 

PAPTPE. *KPAPTPETP KTCWKQENGM WYFYKTDGSM 

ATGWLQNNGS WYYLNSNGAM ATGWHQNNGS WYYLNS 



L KEIDESDSED YLKEGLRAPIi QSKLDTKKAK LSKLEELSDK 

IDELDAEIAK LiEVQIiKDAEG NNNVE A,YFKEGLEK TTAEKXAELE 

KAEADLKKAV DEPDTPAPA PQPA PAPEKPAE *.TC 

PAPAPAP... EKPAPAPB .K.PAPA PEKPAP . . AP EKPAPAPEK. 

PAPAPE .KPAPAFEKP APAPKPETPE TRLETRKRY. 



L KEIDESDSED YLKEGLRAPIi QSKLDTKKAK IiSKLEELSDK 

IDELDAEIAK LEVQLKDAEG NNNVE A.YFKEGLEK TTAEKKAELE 

KAEADLKKAV DEPETPAPA PQPA PAPEKPAE/* ........ # K 

. .PAPAP.,, EKPAPAPB K.PAPA P EKPAPAPEK. 

PAPAPE .KPAPTPETP KTGWKQENGM 
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Bg8838c 
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L820131C 



Bgll703c 
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L KEXDESESED YAKEGFRAPL QSKLDRICKAK LSKLEELSDK 

HJELDA3SIAK LEDQLKAAEE NKNVE DYFKEGLEX TIAAXKAELE 

KTEADLKKAV NEPEKPA PAPET PAPEAPAE QPK 

PAPAPQP AFAPKFE K. . .PAEQPK PEKTDDQQAE EOTARHSEEE 

YNRI/TQQQPP KAEKPAPA. . PKTGWKQENG MWYFYNTDGS M 



GBQA. . . »GQYRAAAEG DLAAXQAELE 

KTEADLKKAV NEPEK. .PA PAPET PAPEAPAE QPK 

PAPAPQP APAPKPE K. . .PAEQPK AEKTDDQQAE EDYARRSEEE 

YNRLTQQQPP KAEKPAPA. . PKPEQPAPA 



PK NSKGBQA. , . .EQYRSAAGG DLAAKQVBLE 

KTEADLKKAV NEPEK. .PA PAPET PAPEAPAE QPK 

PAPAPQP APAPKPE K. . ♦ PAEQPK AEKPADQQAE EDYDRRSEEE 

YNRLTQQQPP KAEKPAPA. . PQPBQPAPAP KS 



L KEIDESDSED YVKEGFRAPL QSELDAKQAK LSKLEELSDK 

IDELHAEIAK . LBKDVEDFK XSDGBQA. , . .GQY1AAAEE PLXAKKABLK 

QTEADLKKAV NEPGKPAPA PAPET PAPEAPAE QPK 

PAPET. P APAPKPE K. « .PAEQPK PEKPADQQAE EDYARRSEEB 

YNRLTQQQPA PAQKPEQP • . AKPEKPAEEP TQPEK 



. • . .DAEIAK .LEKNVEYF& KTDAEQT. . . .EQYLAAAEK BLADKKAELE 

KTEADLKKAV NEPEKPAEE TPAPA PKPEQPAE. . .......QPK 

PAPAPQP APAPKP EKTDDQQAE EDYARRSEEE 

YNRLPQQQPP KAEKPAPA. . PKPEQPVP 



. . # # * *.«••*•• .A EXPENPAP. - ...... .APX 

PAXAPQPLKP EEPAEQPKPE KPEEPAGQPE PEKPDDQQAG EDYARRSGGE 
VNRFPQQQPP KAEKPAPA. . PKPEQPVPAP XT 



LLKKA klagakskaa 

TKKAELEPEL EKAEAELENL LSTLDPEGKT QDELDKEAAE . . . .AELNKK 
VEALPNQVSE LEBELSKLED NLKDAETONV EDXTKEGLEE AIATKQABLE 

KT P KELDAALNBL G.PDGDEEET PPPEAPAE .QPK 

PEK. PAEET. . . . . . PAPAPK PEKSADQQAE EDYARRSBEE 

YNRLTQQQPP KAEKPAPAPA PXFEQPAPAP KSR 



GLATKKKL NLAEARIELL 

LKKLGLEPGL EXftaAGLONL LSTLDPEGKT QDELDKEAAE . * • .AELNKK 
VEALPNQVAE LEEELSKLED NLKDAETNHV EDYUCEGLEE AIATKQABLE 

KT P KELDAALNEL G-PDGDEEET PAPEAPAE. • . * QPK 

PEK.PAEET .PAPAPK PEKSADQQAE EDYARRSEEE 
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Bg7561c 
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YNRDTQQQPP KAEKPAPAPA PKPEQPAPAP K 



KXQKVNLENL LSTLDPGGKT QDELDKGAAE AELMKK 

VEALPNPVXE LEEELSPPED MJCDAETNHV EDYIKEGLEE AIATKQAELE 

ET P QEVDAALNDL V.PDGGEEET PAPAPQPD EPA 

PAPAPNAEQ. PAPAPK PE3CSADQQAE EDYARRSEGE 

YNRLTQQQPP KAEKPAPAPA PKPEQPAPAP N 



KEHAK IiQSDLKDAEE KNVEDYXKEG LEQAITNKKA ELATTQQNID 

KT* . . .QKDL EDftRT,Kr,FKV LATLDPEGKT QDELDKEAAE AELNKK 

VEALQNQVAE LEEELSKLED NLKDAETHNV EDYIKEGLEE AIATKKAELE 

KT Q KELDAALNEL G.PDGDEEET PAPAPQPE. KPA 

EEP EN PAPAPK PEKSADQQAE EDYARRSEEE 

YNRLTQQQPP KAEK..PAPA PQPEQPAPAP KIE 



Wu2c 



• • • * * # 



L KSIDESESED YAKEGFRAPL HSKLDAKKAK LSKLKELSDK 

IDELDAEIAX LEDQLKAVEE NNNVE DYSTEGLEK TIAAKKTELE 

KTEAD&KKAV NEPEKSAEEP SQPEKPAEEA PAPEQPTEPT - 

QPEKP AEETPAPKPE K...PAEQPN AEKTDDQQAE EDYARRSEEE 

YNRLTQQQPP KAEKPAPA. . PQPEQTSSLH 



f/£ 13 (fLet(-8) 
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complete ••auenco for m&tzp&n 

Seauenco Sanaa: 1 to 1453 ' 



Sequence Songs 

10 20 30 40 50 60 70 

TTCACAAATX TTTACGGAGG AGGCTTXTGC TTAAXATAAG TATAQGCTAA AAATGKTTAT OY3AAAAGAG 

80 90 10D 110 120 130 

* • * * • » 

OTAAATTTAG ATG AAT AAG AAA AAA ATS AIT TEA ACA AGC CTA GCC AGC GTC OCT ATC TTX OOO 
XNXXKHXLT5LASVAIL G> 
140 ISO 160 170 100 190 

* • * * • ; 

OCT GGT TTT QTT GCG TCT ICQ CCT ACT TTX OTA AGA OCA GAA GAA OCT OCT CTA OCT AAC 
AGFVAS3PTrVRABBAPVA»> 

200 310 320 230 240 2f0 

* * * * * * 

GAG TCT AAA GCT GAG AAA GAC TAT GAT OCA CCA OTG AAA AAA TCT GAA OCT CCT AAG AAA 
Q5KA£KDYDAAVKX6EAAKX> 

«0 270 300 290 300 310 

* * * * * * 

CAT TAC CAA AOG GCT AAA AAG AAA CCA CAA GAC GCT CAG AAG AAA TAT CAT GAG CAT CAG 

dyktaxkxaedaqxxydei><£ 

330 230 340 350 aco 370 

* * * * • * 

AAG AAA ACT GAG GCA AAA GCG GAA AAA GAA AGA AAA GCT TCT GAA AAG ATA GCT GAG GCA 
XXT£AKAEX£RKA5EKZAEA> 

380 390 400 410 . 420 430 

* * * * * 

ACA AAA GaA GTT CAA CAA GCG TAG CTA GCT TAT CTA CAA GCT AGC AAC GAA AGT CAG AGA 
TKEVQQAY LAYLQASIIK & Q R> 

440 450 4(0 470 Afifl 490 

AAA GAG GCA GAT AAG AAG ATA AAA GAA GCT ACG CAC CCA AAG ATG AGO CGG ACO TOC AAT 

KEAD K2CIXEATHAJCKRRTCII> 

* 1 

500 510 520 530 540 550 

* * » * * 

TTG ACT ATC GAA TTC GAA CAA CAA TTG TAC TTC CTG AAC CAA GTG ACT TAC CTG AGA CTA 

560 570 580 590 600 610 

* * * + * 
AGA AAA AAG CAG AAG AGO CAA CAA AAG AAG CAG AAG TAT CTA AGA AAA AAT CTG AAG AGO 

«KKQKBQQXXQKY1.RXKLXR> 

w 2 €30 * 40 «50 640 670 

CAG CTA AAG AGO TAT AAG TAT AGA AAA ATA AAA TAC TTG AAC AAG ATG CTG AAA ACG AAA 
«I.KRYKYmKZXYX.MjCMX,XTK» 



400 €90 700 710 720 



730 



AGA AAA TTC AOG TAC TTC AAA ACA AAG TCG CTG ATT TAT AAA AAG GAA TTG CTC TCC ATC 
KKfc «YPXTXSLlYXXXX,LSX> 

74 J> 750 760 770 7S0 790 

**** CAA, AGC GAT TTA AAA CAT OCT 
VAXLNXHIA XI,qsDLKDA> 

80 2 S1 2 WO 630 040 ft50 

^ ^ T ^ X ^ OS? *2F 001 ™ « ATC ACT AAT AAA 

BEH NVEI>TXXBCI,XQAITNX> 

•« 070 B80 890 ,00 910 



F/Cr 13 £W f) 
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AAA GCT GAA TTA GCT ACA ACT CAA CAA AAC ATA GAT AAA ACT CAA AAA GAT TTA GAG GAT 
KABr,ATTQQWlDKTQKDLBB> 



920 



930 



940 950 $60 970 

* * • * 

GCT GAA TTA GAA CTT GAA AAA GTA TTA GCT ACA TTA GAC CCT GAA GOT AAA ACT OVA GAT 
ABLELEKVLATLDP £GKTQD> 



980 



990 



1000 
* 



10X0 



1020 



1030 
* 

GAA TTA GAT AAA GAA GCT GCT GAA GCT GAG TTC AAT GAA AAA GTT GAA GCT CTT CAA AAC 
ELDKEAABAZLNEKVEALQ 



1040 
* 



1050 



10€0 



CAA OTT OCT CBAA TTA CAA GAA OAA CTT 
QVAB LEEZL 



1070 1080 1090 

* * * 

TCh. AAA CTT GAA GAT AAT CTT AAA OAT OCT 
SXLEDKLKDAE> 



1100 



1110 
* 



1120 



1130 



1140 1150 

ACA AAC AAC GTT GAA GAC TAC AIT AAA GAA CGT TTA GAA GAA GCT ATC GOG ACT AAA AAA 
TNNVBDYI XEGLEEAIATKX> 



1U0 



1170 



1180 
* 



1190 



1200 



1210 



CCT CAA TTO GAA AAA ACT CAA AAA GAA TTA CAT CCA CCT CTT AAT CA& TTA CCC OCT CAT 
AELBXTQXBLDAALBEl,CPD> 



1220 
* 



1230 



1340 
* 



1350 



1250 
ft 



1270 



GGA GAT GAA GAA GAG ACT CCA GOG CCG GCT CCT CAA OCA CAA AAA CCA GCT GAA GAG CCT 
GDEEETPAPA PQPEKFAEBP> 



1250 



1390 



13 00 
* 



1310 



1320 
* 



1330 



GAG AAT CCA GCT CCA CCA CCA AAA CCA GAG AAG TCA OCX GAT CAA CAA CCT GAA GAA GAC 
EWFAPAPK J?E KSAX>QQAESD> 



1340 



1350 



1360 



1370 
* 



1380 



1390 



TAT GCT CCT ACA TCA GAA GAA GAA TAT AAT CGC TTC ACC CAA CAG CAA CCG CCA AAA GCA 
YAftR.SEEEYHftL'lTQQQPPK^ 



1400 



1410 



1420 



1430 
* 



1440 



1450 



GAA AAA CCA GCT CCT CCA CCA CAA CCA GAG CAA CCA GCT CCT GCA CCA AAA ATA GAG GC 
EXFAPA?QPEQPA?APKXEA> 




20/43 



WO 97/09994 



PCT/US96/14819 



MC27 MC28 



Top of Gel -J-" 
190 kDa-i * 
82 kDa-p- 
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LSMpspA13/2 



LSMpspA12/6 



9.1 kb-i 



7.7 kb- 





4.2 kb 
3.6 kb 




X 



in 
cs 
U 

2 



vD 

2 



h- 00 

CN CM 
U CJ 



ID *D N M 

C N N N (N 
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Primer LSM13: gcaagcttatgatatagaaatttgtaac 
Primer LSM2 : gcgcgtcgacggcttaaacccattcaccattgg 

Probe LSMpspA13/2 (from RX1 sequence) : 
aagcttatga tatagaaatt tgtaacaaaa atgtaatata aaacacttga 
caaatattta cggaggaggc ttatacttaa tataagtata gtctgaaaat 
gactatcaga aaagaggtaa atttagatga ataagaaaaa aatgatttta 
acaagtctag ccagcgtcgc tatcttaggg gctggttttg ttgcgtctca 
gcctactgtt gtaagagcag aagaatctcc ■ cgtagccagt cagtctaaag 
ctgagaaaga ctatgatgca gcgaagaaag atgctaagaa tgcgaaaaaa 
gcagtagaag atgctcaaaa ggctttagat gatgcaaaag ctgctcagaa 
aaaatatgac gaggatcaga agaaaactga ggagaaagcc gcgctagaaa 
aagcagcgtc tgaagagatg gataaggcag tggcagcagt tcaacaagcg 
tatctagcct atcaacaagc tacagacaaa gccgcaaaag acgcagcaga 
taagatgata gatgaagcta agaaacgcga agaagaggca aaaactaaat 
ttaatactgt tcgagcaatg gtagttcctg agccagagca gttggctgag 
actaagaaaa aatcagaaga agctaaacaa aaagcaccag aacttactaa 
aaaactagaa gaagctaaag caaaattaga agaggctgag aaaaaagcta 
ctgaagccaa acaaaaagtg gatgctgaag aagtcgctcc tcaagctaaa 
atcgctgaat tggaaaatca agttcataga ctagaacaag agctcaaaga 
gattgatgag tctgaatcag aagattatgc taaagaaggt ttccgtgctc 
ctcttcaatc taaattggat gccaaaaaag ctaaactatc aaaacttgaa 
gagttaagtg ataagattga tgagttagac gctgaaattg caaaacttga 
agatcaactt aaagctgctg aagaaaacaa taatgtagaa gactacttta 
aagaaggttt agagaaaact attgctgcta aaaaagctga attagaaaaa 
actgaagctg accttaagaa agcagttaat gagccagaaa aaccagctcc 
agctccagaa actccagccc cagaagcacc agctgaacaa ccaaaaccag 
cgccggctcc tcaaccagct cccgcaccaa aaccagagaa gccagctgaa 
caaccaaaac cagaaaaaac agatgatcaa caagctgaag aagactatgc 
tcgtagatca gaagaagaat ataatcgjctt gactcaacag caaccgccaa 
aagctgaaaa accagctcct gcaccaaaaa caggctggaa acaagaaaac 
ggtatgtggt acttctacaa tactgatggt tcaatggcga caggatggct 
ccaaaacaac ggttcatggt actacctcaa cagcaatggt gctatggcta 
caggttggct ccaatacaat ggttcatggt attacctcaa cgctaacggc 
gctatggcaa caggttgggc taaagtcaac ggttcatggt actacctcaa 
cgctaatggt gctatggcta caggttggct ccaatacaac ggttcatggt 
attacctcaa cgctaacggc gctatggcaa caggttgggc taaagtcaac 
ggttcatggt actacctcaa cgctaatggt gctatggcta caggttggct 
ccaatacaac ggttcatggt actacctcaa cgctaacggt gctatggcta 
caggttgggc taaagtcaac ggttcatggt actacctcaa cgctaatggt 
gctatggcaa caggttgggt gaaagatgga gatacctggt actatcttga 
agcatcaggt gctatgaaag caagccaatg gttcaaagta tcagataaat 
ggtactatgt caatggttta ggtgcccttg cagtcaacac aactgtagat 
ggctataaag tcaatgccaa tggtgaatgg gtttaagccg 
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Primer LSM12: ccggatccagcgtcgctatcttaggggctggtt 
Primer LSM6: ctgagtcgactggagtttctggagctggagc 

Probe LSMpspA12/6 (from RX1 sequence) : 

ccagcgtcgc tatcttaggg gctggttttg ttgcgtctca gcctactgtt 
gtaagagcag aagaatctcc cgtagccagt cagtctaaag ctgagaaaga 
ctatgatgca gcgaagaaag atgctaagaa tgcgaaaaaa gcagtagaag 
atgctcaaaa ggctttagat gatgcaaaag ctgctcagaa aaaatatgac 
gaggatcaga agaaaactga ggagaaagcc gcgctagaaa aagcagcgtc 
tgaagagatg gataaggcag tggcagcagt tcaacaagcg tatctagcct 
atcaacaagc tacagacaaa gccgcaaaag acgcagcaga taagatgata 
gatgaagcta agaaacgcga agaagaggca aaaactaaat ttaatactgt 
tcgagcaatg gtagttcctg agccagagca gttggctgag actaagaaaa 
aatcagaaga agctaaacaa aaagcaccag aacttactaa aaaactagaa 
gaagctaaag caaaattaga agaggctgag aaaaaagcta ctgaagccaa 
acaaaaagtg gatgctgaag aagtcgctcc tcaagctaaa atcgctgaat 
tggaaaatca agttcataga ctagaacaag agctcaaaga gattgatgag 
tctgaatcag aagattatgc taaagaaggt ttccgtgctc ctcttcaatc 
taaattggat gccaaaaaag ctaaactatc aaaacttgaa gagttaagtg 
ataagattga tgagttagac gctgaaattg caaaacttga agatcaactt 
aaagctgctg aagaaaacaa taatgtagaa gactacttta aagaaggttt 
agagaaaact attgctgcta aaaaagctga attagaaaaa actgaagctg 
accttaagaa agcagttaat gagccagaaa aaccagctcc agctccagaa 
actccag 
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1 2 34 5678 



2.1 kb 



1.4 

1.1 

1.0 
0.9 

0.8 

0.7 



kb 
kb 
kb 
kb 

kb 
kb 



0.4 kb 
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SKH2 5' CAT ACC gTT TTC TTg TTT CCA gCC -3" 

LSM13 5' gCA AgC TTA TgA TAT AgA AAT TTg TAA C -3" 

N192 5' ggAAggCCATATgCTCAAAgAgATTgATgAgTCT -3' 

C588 5' CCAAggATCCTTAAACCCATTCACCATTggC -3* 
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Figure » Comparison of the structural motifs of PspA and PspC. 
PspA has a smaller a-helical region, and does not contain the direct 
repeats within the a-helix (indicated by the dashed lines). The a- 
helical regions which are homologous between PspA and PspC are 
indicted by the striped pattern, PCR primers are indicated by the 
arrows. 
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249 SDSSVGECTLPSPSLNMANE5QTEHRKDVDEYIK 298 

. ; | . | : . : . : : | ' : | . | : : . | | . : - - . : | . . : : 

. 1 EES PVASQSKAEKDYDAAKKDAKNAKKAVZD . AQKALDDAKAAQKKYDED 49 

• » * » - 
299 VNLNI KLSAIKTKYLYELSVLKENSKKEELTSKTKAELTAAr EQFKKDTL 348 

■ w * * i * I** * i i * *i* i** * i * «•*..• i 

50 QKXTEEXAAL EKAASEEM . DKAVAAVQQAYLAYQQATD 86 

m • - • * . 

349 KPEKKVAEAEKKVEEAKKKAKDQKEEDRRNYPTbTl^ 398 

| ; . | • . | : ] . : : | | | | : . . : • | . . ) | : - : : 
87 KAAK. .DAADKMIDEAKKREEEAKTK FNTVRAMW 119 

***** 

399 KEAELELVKEEANESRNEEKIKQAKEKVESKKAEATRLEKIKTDRKKAEZ 448 
120 PEPEQLAETKXKSEEAKQKAPEL TKKLE 147 

a » « • • 

449 EAKHKAEESEKKAAEAKQKVDAEEYALEAK^ 498 

Ml I MMMMMMMMM I :||lllll = hlll-lllllli 

148 EAXAKLEEAEKXATEAKQKTOAEEVAPQAKIAELENQVHRLEQELKEIDE 197 

# * * V * 

499 SDSEDYLKEGIJlAPLQSKLiyrKKAKLSKLEELSDKIDELDAEIAKLEVQL 548 

MUM llhllMMIII-INMIIIIIIIIII llillil MM 1 1 

198 SESEOYAKEGFRAPLQSKLDAKKAKLSKLEELSDKIDELDAEIAiCLEDQL 247 

» • • » * 

549 KDAEGNNNVEAYFKSGLEKTTAEXXAELE^ 598 

l-lhllllMlllllillMMIllllMlllllMMIIMIIII: 

243 KAAHENNNVSDYFKSGLEXTIAAKKAa^ 297 
599 PAPAPEKPAE - . XPAPAPEX . PAPAPEKPA . . PAPEXPAPA 634 

.Mil III IIIIIM- III 1 1 1 1 1 I III — 

298 .TPAPEAPAZQPXPAPAPQPAPAPKPEXPAEQPKPEXTDDQQAEEDYAHK. 346 



625 PEXP A? APEKPAPT P ETPK7GWKQ ENGMWY7YNTDGS51ATGW 676 

■ --I MINI MLMMMMM M M M M 

347 SEEEYKBLTQCQ?S , KAEK?AP. . .APXTGWKQETOMWYrYNTOGS 388 



677 LQTOKSSWCTLNSNGA^TC^ 726 

MMMM MMMMMMM MMM MMMI 

389 I^TGWLQl^GSWYYl^SNGA^TG;-nOYNGS\VYYI. 423 

***** 

727 NANGDl^TGWLQYNGSWYYLNAKGDMATC^QYNG 776 

IMMIMM - MMMMMMMMMMMMMMM MM Ml 

424 NANGAMATGWAXVNGSWYYIJNANGAMAT^^ 473 

* + £ * * * 

777 FQYNG SWYYLNANGDMATGWLQYNG SWYYLNSNGAMVTG*»fLQNNGSWYYL 826 

* MMMIMMi MMMM1MIMIMMMMM ! - IIIIIM 

474 AXVNGSWYYIATWGAMATGWLQYNGSWYYl^M^ 523 

* •■ » • * 

827 NAN^SHATDWVKDGDTrfYYLEJ^ 876 

MMMMM MMMMIM M M IMMMMMM M M I MMM 

524 NANGAMATGWVXDGDTWTifLEASGAMKASQWFXVSDK;^ 573 



877 TTVDSYRVNAKGEWV 891 

lllhhllllllll 
574 TTVDGYKVNANGEVJV 588 

Figure X, Bestfit analysis of PspA and PspC. Percent identity 
is 69% and percent similarity is 77%. Amino acids of PspA are on 
the bottom line (1-588) and amino acids of PspC are on the top 
line (249-891) . A dashed line indicates identity. 
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Figured Hie coiled-coil motif of the a helix of PspC 
Amino acids that are not in the coiled-coii motif are in the right column. 
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Sequence of the alpha helical and proline region of LXS532 (PspC.D39) . 
Nucleotides 1-516 include the upstream region, noncoding region. The 
alpha helical region is underlined (bp 517-1112) and the proline region 
is italicized (1113-1326) . 



1 


CCAAGCTATT 


AGGTGACACT 


ATAGAATACT 


CAAGCTATGC 


ATCAAGCTTA 


51 


TGCTTGTCAA 


TAATCACAAA 


TATGTAGATC 


ATATCTTGTT 


TAGGACAGTA 


101 


AAACATCCTA 


ATTACTTTTT 


AAATATTCTT 


CCTGAGTTGA 


TTGGCTTGAC 


151 


CTTGTTGAGT 


CATGCTTATG 


TGACTTTTGT 


TTTAGTTTTT 


CCAGTTTATG 


201 


CAGTTATTTT 


GTATCGACGA 


ATAGCTGAAG 


AGGAAAAGCT 


ATTACATGAA 


251 


GTTATAATCC 


CAAATGGAAG 


CATAAAGAGA 


TAAATACAAA 


ATTCGATTTA 


301 


TATACAGTTC 


ATATTGAAGT 


AATATAGTAA 


GGTTAAAGAA 


AAAATATAGA 


351 


AGGAAATAAA 


CATGTTTGCA 


TCAAAAAGCG 


AAAGAAAAGT 


ACATTATTCA 


401 


ATTCGTAAAT 


TTAGTATTGG 


AGTANCTAGT 


GTAGCTGTTG 


CCAGTCTTGT 


451 


TATGGGAAGT 


GTGGTTCATG 


CSACCAGARA 


AACGAKGGAA 


GTACCCAAGC 


501 


AGCCMCTTCT 


TCTAATATGG CAAAGACAGA ACATAGGAAA GCYGCTAAAC 



551 MAGTCGTCGA TGAA TATATA GAAAAAATGT TGAGGGAGAT TCAACTAGAT 

601 AGAAGAAAAC ATAC CCAAAA TGTCGCCTTA AACATAAAGT TGAGCGCAAT 

651 TANAACGAAG TAT TTGCGTG AATTAANTGT TNTAGAAGAG AAGTCGAANN 

701 ATGACfTTGCC GTC AGAAATA AAAGCGAAGT T AG ACGCCGC TTTTGANAAG 

751 TTTAAAAAAG ATAC ATTGAA ACCAGGAGAA AAGGTAGCNG AAGCTAAGAA 

80; GAANG TTGAA GAAGCTAAGA AWAAAGCCRA GGATCAAAAA GAAGAAGATC 

851 GYCGTAACTA CCCA ACCAAT ACTTRCAAAA CGCTTGACCT TGAAATTGCT 



901 GAGTYCGATG TGAAAGTTAA AGAAGCGGAG CTTGAACTAG TAAARGAGGA 

951 AGCT MMRGAA YCTCGAGACG AGGAAAAAAT TAAGCAAGCA AAAGCGAAAG 

1001 TTGAGAGTAA AAAAGCTGAG GC TACAAGGT TAGAAAACAT CAAGACAGAT 

1051 NGTAA AAAAG CAGAAGAAGA AGNTAAACGA AAAGCAGCAG AAGAAGATAA 

1101 AGTTAAAGAA AAACCAGCTG AA CAACCACA ACCAGCGCCG GNTACTCAAC 

1151 CAGAAAAACC AGCTCCAAAA CCAGAGAAGC CAGCTGAACA ACCAAAAGCA 

1201 GAAAAAACAG ATGATCAACA AGCTGAAGAA GACTATGCTC GTAGATCAGA 

1251 AGAAGAATAT AATCGCTTGA NTCAACAGCA ACCGCCAAAA ACTGAAAAAC 



fig. 2S 
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1301 CAGCACAACC ATNTACTCCA AAAACA 



PCIYUS96/14819 



Fig. 2S Ctoo^nufd) 
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Comparison of nucleotides of pspA.Rxl to pspC.D39. PspA is the top line 
(bp 875- 1322) and pspC is the bottom line (bp 877- 1326). The region 
which is most homologous includes the nucleotides which encode the 
proline region (bp 1113 pspC, bp 1128 pspA) . 

Percent Similarity: 77.083 Percent Identity: 74.769 



875 AAAAAGCTAAACTATCAAAACTTGAAGAGTTAAGTGATAAGATTGATGAG 924 

I I I I Ml II I III III lllh II || II I II 
877 AAAACGCTTGACCTTGAAA . . TTGCTGAGTYCGATGTGAAAGTTAAAGAA 924 



925 TTAGACGCTGAAATTGCAAAACTTGAAGATCAACTTAAAGCTGCTGAAGA 974 

II I I I I I I llh I I I I h:: I II || 

925 GCGGAGCTTGAACTAGTAAARGAGGAAGCTMMRGAAYCTCGAGACGAGGA 974 



975 AAACAATAATGTAGAAGACTACTTTAAAGAAGGTTTAGAGAAAACTATTG 1024 

in i in ii i i i iii in mi 

975 AAAAATTAAGCAAGCAAA AGCGAAAGTTGAGAG 1007 



1025 CTGCTAAAAAAGCTGA ATTAGAAAAAACTGAAGCTGACCTT 1065 

IIIMMIIIII MINIM | | | || : | 

1008 TAAAAAAGCTGAGGCTACAAGGTTAGAAAACATCAAGACAGATNGT 1053 



1066 AAGAAAGCAGTTAATGAGCCAGAAAAACCAGCTCCAGCTCCAGAAACTCC 1115 

n r 1 1 1 1 1 1 i ii ih 1 1 1 1 i mi mi i 

1054 AAAAAAGCAG. . - AAGAAGAAGNTAAACGAAAAGCAGCAGAAGAAGATAA 1100 



1116 AGCCCCAGAAGCACCAGCTGAACAACCAAAACCAGCGCCGGCTCCTCAAC 1165 

n mi 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 iiiiiiimihi mini 

1101 AGTTAAAGAAAAACCAGCTGAACAACCACAACCAGCGCCGGNTACTCAAC 1150 
1166 CAG. . .CTCCCGCACCAAAACCAGAGAAGCCAGCTGAACAACCAAAACCA 1212 

Ml II II lllllllllllllllllllllllllllllllll II 

1151 CAGAAAAACCAGCTCCAAAACCAGAGAAGCCAGCTGAACAACCAAAAGCA 1200 



1213 GAAAAAACAGATGATCAACAAGCTGAAGAAGACTATGCTCGTAGATCAGA 1262 

IMII II IIMIIIIIIIIIIMIIII I Mlllll IIIIIIMIIMM I 

1201 GAAAAAACAGATGATCAACAAGCTGAAGAAGACTATGCTCGTAGATCAGA 1250 



1263 



1312 



AGAAGAATATAATCGCTTGACTCAACAGCAACCGCCAAAAGCTGAAAAAC 

1 1 1 f 1 1 1 1 1 f 1 1 1 ( 1 1 1 1 1 1 ^ 1 1 1 1 1 1 1 1 1 1 f i 1 1 f r t r f iiiiiiiii 

1251 AGAAGAATATAATCGCTTGANTCAACAGCAACCGCCAAAAACTGAAAAAC 1300 

1313 CAGCTC CTGCACCAAAAACA 1332 

Mil I :( I IIIIIIIII 

1301 CAGCACAACCATNTACTCCAAAAACA 1326 



fig. 2k 
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BESTFIT pspC.EF6796 and pspC.D39, pspC of EF6796 is the top line (bp 1- 
bp 1042)and pspC of D39 is the bottom line (bp 44 - bp 1087). The 
sequences are highly homologous in the upstream noncoding region and the 
DNA encoding the proline region. 

Percent Similarity: 88.322 Percent Identity: 86.065 

* • * • * 

1 AAGCTTATGCTTGTCAATAATCACAAATATGTAGATCATATCTTGTTTAG 50 

lllllllllllllllillllllllllllllllllllllllllllllllll 

44 AAGCTTATGCTTGTCAATAATCACAAATATGTAGATCATATCTTGTTTAG 93 

* - . » 

51 GACAGTAAAACATCCTAATTACTTTTTAAATATTTTACCTGAGTTGATTG 100 

F ) 1 1 1 f I ! f 1 1 1 1 1 1 1 ! 1 1 1 1 i 1 1 1 1 i I E 1 1 1 1 1 | 1 1 1 1 i 1 1 1 M 1 1 1 

94 GACAGTAAAACATCCTAATTACTTTTTAAATATTCTTCCTGAGTTGATTG 143 

* • • » « 

101 GCTTGACCTTGTTGAGTCATGCCTATATGAOTTTTGTTTTAGTTTTTCCA 150 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 f in 1 1 1 1 1 1 r 1 1 e 1 1 j 1 1 1 1 1 1 1 1 r i 

144 GCTTGACCTTGTTGAGTCATGCTTATGTGACTTTTGTTTTAGTTTTTCCA 193 

» ■ » a . 

151 GTTTATGCAGTTATTTTGTATCGACGAATAGCTGAAGAGGAAAAGTTATT 200 

MINI lllllll IIIIIIIIIMIMIIIIIIMIIIIimil Mil 

194 GTTTATGCAGTTATTTTGTATCGACGAATAGCTGAAGAGGAAAAGCTATT 243 



201 ACATGAAGTTATAATCCCAAATGGAAGCATAAAGAGATAAATACAAAATT 250 

Ml I II IIIIIIMI MMII I Mill I II Mill III M I II III II II 

244 ACATGAAGTTATAATCCCAAATGGAAGCATAAAGAGATAAATACAAAATT 293 

■ * * m • 

251 CGATTTATATACAGTTCATATTGAAGTGATATAGTAAGGTTAAAGAAAAA 300 

iii i ii ii iii ii ii mi ii mi n mi i limn i mil mi 

294 CGATTTATATACAGTTCATATTGAAGTAATATAGTAAGGTTAAAGAAAAA 343 

- ♦ . . 

301 ATATAGAAGGAAATAAACATGTTTGCATCAAAAAGCGAAAGAAAAGTACA 350 

MM MM III II I MMII II MM II MM III Mill I Mil Mill 

344 ATATAGAAGGAAATAAACATGTTTGCATCAAAAAGCGAAAGAAAAGTACA 393 

* • • . . 

351 TTATTCAATTCGTAAATTTAGTATTGGAGTAGCTAGTGTAGCTGTTGCCA 400 

1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 ^ i r 1 1 1 r 1 1 1 1 1 1 1 1 f 1 1 1 

394 TTATTCAATTCGTAAATTTAGTATTGGAGTANCTAGTGTAGCTGTTGCCA 443 

401 GCTTGTTCTTAGGAGGAGTAGTCCATGCAGAAGGGGTTAGAAGTGGGAAT 450 

I I I I III I II II Mill I I | | | :|||l 
444 GTCTTGTTATGGGAAGTGTGGTTCATGC. . SACCAGARAAACGARGGAAG 491 

451 AACCTCA CGGTTACATCTAGTGGGCAAGATATATCGAAGAAGTATG 496 

nil iii mi i i i i i i i i i i i 

492 TACCCAAGCAGCCMCTTCTTCTAATATGGCAAAGACAGAACATAGGAAAG 541 

* • • • • 

497 CTGATGAA GTCGAGTCGCATCTAGAAAGTATATTGAAGGATGTC 540 

Ml I II Mill II 1 1 1 1 1 1 II 1 1 1 1 IN | 

542 CYGCTAAACl^GTCGTCGATGAATATATAGAAAAAATGTTGAGGGAGATT 591 
541 AAAAAAAATTTGAAAAAAGTTCAACATACCCAAAATGTCGGCTTAATTAC 590 

ii i ii i ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 inn i 

592 CAACTAGATAGAAGAA AACATACCCAAAATGTCGCCTTAAACAT 635 

* 

591 AAAGTTGAGCGAAATTAAAAAGAAGTATTTGTATGACTTAAAAGTTA 637 
I I I I 1 t I I I I I llllhll I I I I I I I I 1 I HI INI: llh' 
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636 AAAGTTGAGCGCAATTANAACGAAGTATTTGCGTGAATTAANTGTTNTAG 685 

• • * » * 

638 ATGTTTTATCGGAAGCTGAGTTGACGTCAAAAACAAAAGAAACAAAAGAA 687 

I I III I- lllllll I I I I lllll Mill 

686 AAGAGAAGTCGAANNATGAGTTGCCGTC AGAAATAAAAGCG 726 

■ • * ■ * 

688 AAGTTAACCGCAACTTTTGAGCAGTTTAAAAAAGATACATTACCAACAGA 737 

llllll Ml Mlllll: III I I I I Ml I I I II I I I I I 
727 AAGTTAGACGCCGCTTTTGANAAGTTTAAAAAAGATACATT GAA 770 

* * • * » 

738 ACCAGAAAAAAAGGTAGCAGAAGCTCAGAAGAAGGTTGAAGAAGCTAAGA 787 

Mill I I I II lllll h I lllll MM llhlll I I I I MM I I Ml 
771 ACCAGGAGAAAAGGTAGCNGAAGCTAAGAAGAANGTTGAAGAAGCTAAGA 820 

* ■ * w * 

788 AAAAAGCCGAGGATCAAAAAGAAAAAGATCGCCGTAACTACCCAACCATT 837 

MM I II Ml II MM I II M M 1 1 II 1 1 1 = 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

821 AWAAAGCCRAGGATCAAAAAGAAGAAGATCGYCGTAACTACCCAACCAAT 870 
838 ACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTGGAAGTTAA 887 

iiihiiui mini iiiiiiiiiiiiiiihiiiiiii mini 

871 ACTTRCAAAACGCTTGACCTTGAAATTGCTGAGTYCGATGTGAAAGTTAA 920 
888 AAAAGCGGAGCTTGAACTAGTAAAAGTGAAAGCTAAGGAATCTCAAGACG 937 

I II I II II I II I M II M II MMI I IMIM-MMIM Mill 

921 AGAAGCGGAGCTTGAACTAGTAAARGAGGAAGCTMMRGAAYCTCGAGACG 970 

• • • * 

938 AGGAAAAAATTAAGCAAGCAGAAGCGGAAGTTGAGAGTAAACAAGCTGAG 987 

1 1 1 1 mi ii i im i Mm i iiiii miiimmii mimi 

971 AGGAAAAAATTAAGCAAGCAAAAGCGAAAGTTGAGAGTAAAAAAGCTGAG 1020 

■ • # * . 

988 GCTACAAGGTTAAAAAAAATCAAGACAGATCGT GAAGA 1025 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllllllllhll Mill 

1021 GCTACAAGGTTAGAAAACATCAAGACAGATNGTAAAAAAGCAGAAGAAGA 1070 

1026 AGCTAAACGAAAAGCAG 1042 

I I : I I I III III III I I 
1071 AGNTAAACGAAAAGCAG 1087 
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Amino acid comparison of PspC of EF6796 and D39. The size of the PspC 
molecule of D39 is 1/3 the size of PspC of EF6796. PspC.D39 does not 
appear to conatain a signal sequence. 

Percent Similarity: 71*212 Percent Identity: 67.803 



306 SQTEHRKD VDEYIKKMLSEIQLDRRKHTQNVNLNIKLSAIKTKYLY 351 

-■Hill- I t I I I - I 1 I - I I I I t I I 1 I I I I I - I I I 1 I I I I | | | | 

2 AKTEHRKAAKXWDEYI EKMLREIQLDRRKHTQNVALNI KLSAIXTKYLR 51 

• • . » m 

352 ELSVLKENSKKEELTSKTKAELTAAFEQFKKDTLKPEKKVAEAEKKVEEA 401 

II I -I I M- h ll ■ l-l II -11111111-11111.1 INI 

52 ELXVXEEKS . XXELPSEIKAKLDAAFXKFKKDTLKPGEKVAEAKKXVEEA 100 

. * . • . 

402 KKKAKIXJKEEDRRNyPTNTYKTLELEIAESDVKVKEAELELVKEEANESR 451 

I II I I I I I I I I I I I I I I II hi III I IIIIIIIIIIIIIMI I I 
101 KXKAXIX^KEEDRRNYPTNTXKTI^ 150 

* • • * m 

452 NEEKIKQAKEKVESKKAEATRLEKIKTDRKKAEEEAKRKAEESEKKAAEA 501 

IIIIMIM IMMMIMIII MINI ||||.|-:| 
151 DEEKIKQAKAKVESKKAEATRLENIKTDXKKAEEEXKRKAAEEDK 195 



552 SKLDTKKAKLSKLEELSDKIDELDAEIAKLEVQLKDAEGNNNVEAYFKEG 601 

I- Ml- 

196 VKEKPAEQ 203 

602 LEKTTAEKKAELEKAEADLKKAVDEPETPAPAPQPAPAPEKPAEKPAPAP 651 

.:|.. .-:.-||| IIMH-I 

204 PQPAPXTQPEKPAPKPEKPAEQPKAEK 230 

• ■ . • . 

652 EKPAPAPEKPAPAPEKP . APAPEKPAPAPEKPAPTPETPKT 691 

I- -I-- :--:| .||||. | Mil 

231 TDDQQAEEDYARRSEEEYNRLXQQQPPKTEKPAQ . PXTPKT 270 
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Comparison of the amino acids of PspC.D39 and PspA.Rxl. There is little 
homology except in the proline region. PspA.Rxl is the top line (aa 91 
- aa 444); PspC.D39 the bottom line (aa 2 - aa 270). 

Percent Similarity: 56.767 Percent Identity: 42.857 



91 AKKDAKNAKKAVEDAQKALDDAKAAQKKYDEDQKKTEEKAALEKAASEEM 140 
| | . : : • | | | | : . : . . : | . : | . : . . | | : | . 

2 AKTEHRKAAKXWD EYIEKMLREIQLDRRKHTQNVALNXKLSAIX 46 

• • • * . 

141 DKAVAAVCX3AYLAYQQATDKAAKDAADKMIDEAKKREEEAKTKFNTVRAM 190 
*| I"" • • • J | • * • I || • 

47 TK YLRELXVXEEKSXXELPSEIKAKLDAAFXKF. . _ KKD 82 

• » * * » 

191 WPEPEQIAETKKKSEEAKQKAPELTKKLEEAKAKLEEAEKKATEAKQKV 240 
. : . - . | . : | | . | | I I I I I I : - I h - : | : . - - : 

83 TLKPGEKVAEAKKXVEEAKXKAXD QKEEDRRNYPTNTXKTL 123 

. . * . . 

241 DAEEVAPQAKIAELENQVHRLEQELKEIDESESEDYAKEGFRAPLQSKLD 290 

| | . . : . | : | | : : *- : | | * • I * • M • I : - 

124 DLEIAEXDVKVKEAELEL . . VKEEAXEXRDEEKXKQAK. . AKVE 163 

• • « « . 

291 AKKAKLSKLEELSDKIDELDAEIAKLEDQLKAAEENNNVEDYFKEGLEKT 340 

-III- -:||:: 

164 SKKAEATRLENI 175 

• « • * . 

341 IAAKKAELEKTEADLKKAVNEPEKPAPAPETPAPEAPAEQPKPAPAPQPA 390 

.|-|.s h --|-=| ..| IIMUM -||. 

176 KTDXKKAEEEXKRKAAEEDK VKEKPAEQPQPAPXTQPE 213 

• . • * . 

391 . PAPKPEKPAEQPKPEKTDDQQAEEDYARRSEEEYNRLTQQQPPKAEKPA 439 

1 1 MM MM MMM III 1 1 II I II 1 1 IMM MM llllll. Mil 

214 KPAPKPEKPAEQPKAEKTDDQQAEEDYARRSEEEYNRLXQQQPPKTEKPA 263 

440 PA. . PKT 444 
Ml 

264 QPXTPKT 270 
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